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Abstract. We consider semilinear evolution equations for which the linear 
>J ■ part generates a strongly continuous semigroup and the nonlinear part is suf- 

ficiently smooth on a scale of Hilbert spaces. In this setting, we prove the 
existence of solutions which are temporally smooth in the norm of the lowest 
rung of the scale for an open set of initial data on the highest rung of the 
scale. Under the same assumptions, we prove that a class of implicit, A-stable 
Runge-Kutta semidiscrctizations in time of such equations are smooth as maps 

^A ' from open subsets of the highest rung into the lowest rung of the scale. Under 

^* , the additional assumption that the linear part of the evolution equation is 

jT I ■ normal or sectorial, we prove full order convergence of the semidiscretization 

in time for initial data on open sets. Our results apply, in particular, to the 

1 ***^ ' semilinear wave equation and to the nonlinear Schrodinger equation. 
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1. Introduction 

We study numerical schemes for evolution equations on Hilbert spaces by first 
looking at the properties of a semidiscretization in time only; discretization in 
space is then treated as a perturbation within the Hilbert space setting. This 
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approach was introduced by Rothe [28]. When successful, results so obtained are 
naturally uniform in the spatial discretization parameter. In contrast, when first 
discretizing in space, the resulting finite dimensional system of ordinary differential 
equations (ODEs) can be treated with ODE-based techniques which avoids the 
difficulty arising from the analysis of equations on infinite-dimensional spaces, but 
where uniformity in the spatial mesh size is not immediate. 

In this paper, we consider semilinear evolution equations of the form 

d t U = AU + B(U) 

on a Hilbert space y. The linear operator A is assumed to generate a strongly 
continuous, not necessarily analytic semigroup and B is a bounded nonlinear op- 
erator on y. The examples we have in mind are semilinear Hamiltonian evolution 
equations such as the semilinear wave equation or the nonlinear Schrodinger equa- 
tion with periodic, homogeneous Dirichlet, or homogeneous Neumann boundary 
conditions, or on the line. However, the results in this paper do not depend on a 
Hamiltonian structure. 

We analyze the differentiability properties in initial value and time step of the 
scmiflow of the evolution equation and of a large class of A-stable Runge-Kutta 
methods, including the Gauss-Legendre methods, when applied to the evolution 
equation. To be able to differentiate the semiflow and the numerical method we 
formulate conditions that guarantee uniformity of the time-interval of existence 
(for the semiflow) and the maximum step size (for the numerical methods) over 
bounded sets of parameters. We present two versions of such uniformity results: 
Whenever existence can be achieved, uniformity holds on sufficiently small balls of 
initial data; we will label results of this type by "local version." Assuming more 
regularity for the initial data, we also obtain results which are uniform on bounded 
open sets so long as B is well-defined and bounded. We will label results of this 
type by "uniform version." 

Note that differentiation in time results in multiplication with the unbounded 
operator A and is only well-defined when considered as a map from a subset of 
D(A) to y; this is easily seen by taking B = and differentiating the exact semi- 
flow e tA U°. To be able to differentiate repeatedly in time we assume that B is 
C N ~ k as map from some open set T>k C 34 = D(A k ) to 34 for k = 0, . . . , K and 
N > K. Whether or not this condition is satisfied depends on the given evolu- 
tion equation, and in particular on the boundary conditions; it is satisfied for the 
equations mentioned above in the case of periodic boundary conditions and smooth 
nonlincaritics. We also give examples of PDEs with Neumann boundary conditions, 
Dirichlet boundary conditions, and on the line where this condition is true. We then 
prove that the semiflow of the evolution equation and the numerical method are of 
class C K jointly in time (resp. step size) and initial data when considered as a map 
from T>k to y. Both results require carefully tracking the domains of definition of 
B. Moreover, under the additional assumption that A is normal (or, more gener- 
ally, normal up to a perturbation which is a bounded linear operator on each of the 
34) or that A is sectorial, we show convergence of the scmidiscrctization in time at 
its full order p provided K = p and for initial data U £ T>k+i- 

The exact solution U(t) of the semilinear evolution equation is obtained as a 
fixed point of a contraction map. Similarly, the Runge-Kutta methods we consider 
are implicit as they are functions of the Runge-Kutta stage vectors, which in turn 
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are obtained as fixed points of contraction maps. As for the exact solution, differ- 
entiation in the step size of the Runge-Kutta method results in multiplication by 
the unbounded operator A. Hence, these derivatives are also only well-defined on 
the scale of Hilbert spaces 34- An additional difficulty arises from the fact that 
the semiflow, the numerical method, the contraction maps for scmiflow and stage 
vectors, and their derivatives with respect to the initial data are only strongly con- 
tinuous in the time-like parameter, but not continuous in the operator norm. Hence, 
these maps do not fit into the usual setting of contraction mapping theorems with 
parameters. We therefore address these issues by providing an abstract theory for 
the differentiability properties of fixed points of contraction mappings on a scale of 
Banach spaces. This theory provides a unified framework for the time-continuous 
and time-semidiscrete case. 

Let us mention some related results in the literature. Le Roux [23] studies con- 
vergence results for strongly A-stablc approximations S(/iA) of holomorphic semi- 
groups e on Hilbert spaces, an example of which are strongly A-stable Rungc 
Kutta methods applied to linear parabolic systems. Palcncia [26] and Crouzeix et 
al. [1 2] study stability of A-acceptable rational approximations S(/iA) of holomor- 
phic semigroups c hA on Banach spaces; they show that when Re(specA) < ui for 
some u > 0, then ||S™(/iA)|j < 6 S e^ nh for some w s > 0, 6 S > 0, and all n E N. 
Lubich and Ostermann [24] prove convergence results for Runge-Kutta methods 
applied to semilinear parabolic equations on Banach spaces, cf. [11]. Variable step 
size schemes applied to fully nonlinear parabolic problems have been studied in [16]. 
Gonzalez and Palencia [17] study stability of A-stable Runge-Kutta methods in the 
initial value, as we do, but they study quasilinear parabolic problems and do not 
consider the differentiability properties of the solution. Akrivis and Crouzeix [3] 
discuss multistcp scmidiscretizations in time for parabolic problems; see references 
therein for further related work. 

In [21], quoted above, Hersh and Kato prove convergence of A-acceptable ra- 
tional approximations S(hA) of non-analytic Co-semigroups c for smooth initial 
data. Brenner and Thomee [6] show that A-acceptable rational approximations 
S(hA) of non-analytic Co-semigroups e hA with Re(spec A) < in general grow like 
||S n (/iA)|| = C^rt 1 / 2 ) and study fractional order convergence for non-smooth initial 
data of linear evolution equations, see also [22]; for extensions to variable step size, 
sec [4] . Brenner et al. [7] study convergence of rational approximations of inhomo- 
geneous linear differential equations on Banach spaces, assuming stability of the 
approximation. Colin et al. [8, 9] study modified Crank-Nicolson scmidiscretiza- 
tions in time of nonlinear Schrodingcr equations and Zakharov wave equations. 
They prove convergence as h — > 0, but do not analyze the order of convergence. 

In this paper we have a related, but different objective. Similarly as in [6, 21, 22] 
we consider semidiscretizations in time of evolution equations which are not para- 
bolic, i.e., equations whose linear part does not generate an analytic semigroup. But 
whereas the main issue in [6, 21, 22] is that the spectral theorem is not available for 
the linear operator A so that stability estimates of the form ||S™(/iA)|| < Os e" sn/l , 
as required for the standard convergence analysis, are not available, we assume here, 
like [7], that this estimate holds true, e.g. due to normality of A on the Hilbert space 
y. Our focus is rather on semilinear problems as were considered by Lubich and 
Ostermann [24] in the parabolic case; the class of Runge-Kutta schemes considered 
here is the same as in their work. However, while [24, 111 assume the existence of 



4 M. OLIVER AND C. WULFF 

a temporally smooth solution U(t) of the semilinear evolution equation (or a per- 
turbation of it) to obtain higher order convergence, we provide a detailed analysis 
under which conditions this assumption holds true. 

Our conditions on B yield, in particular, C K smoothness of the scmiflow jointly 
in time and in the initial data for initial values in an open set of a Hilbert space 
~y>K- If the conditions on B are not satisfied, the set of initial values of temporally 
smooth solutions is generally a complicated set which is characterized by nonlinear 
conditions; hence such initial data are in general difficult to prepare numerically. 
We illustrate this for the semilinear wave equation with generic nonlinearity and 
Dirichlct boundary conditions; see Section 2.5.3. Under the same conditions on B, 
differentiability of the numerical method in the step size h and in the initial data 
holds on open sets. This allows us to prove convergence of the numerical method 
without additional stage order conditions as have been assumed in [24]. Moreover, 
we obtain full order convergence, whereas e.g. the convergence results for semilinear 
PDEs of [24] provide an order of convergence that is determined by the stage order 
and that, in general, is smaller than the order of the method. 

Lubich and Ostermann [24] , in the parabolic setting, also obtain convergence re- 
sults when only a perturbation of the continuous solution U(t) is temporally smooth, 
and their estimate of the trajectory error then also depends on this perturbation 
error. In practice, such a perturbation would typically be a space discretization; if 
the continuous solution lacks temporal smoothness, the assumption of a temporally 
smooth solution of a perturbation tending to zero with the step size h typically 
imposes mesh conditions that exclude order p convergence of the semidiscretization 
in time. 

In contrast to [6, 24], our interest in this paper is not on fractional order con- 
vergence for non-smooth initial data. Rather, since we are interested in obtaining 
higher order differentiability of the numerical method in the time step, we restrict 
attention to regular initial data U £ !Vif+i; in particular, we assume enough reg- 
ularity to have full order of convergence, i.e., K > p where p is the order of the 
numerical method. Our convergence result extends the corresponding classical re- 
sult for linear evolution equations of Hersh and Kato [21] to nonlinear systems. 

There has been a lot of recent activity in the application of split step time- 
semidiscrctizations of nonlinear Schrodinger and wave equations: Bcsse et al. [5] 
and Lubich [25] study convergence of split step time-semidiscretizations for nonlin- 
ear Schrodinger equations; also sec [18] for a general framework in the linear case 
and more references, and [15, 14] for long-time preservation of actions of nonlinear 
Schrodinger equations under split step timc-scmidiscrctizations. While splitting 
methods arc very effective for simulating evolution equations for which the linear 
evolution c tA can easily be computed explicitly, Runge-Kutta methods are still 
a good choice when an eigendecomposition of A is not available, as for example 
for the semilinear wave equation in an inhomogeneous medium; see Section 2.5.5. 
Moreover, the simplest example of a Gauss-Legendre Runge-Kutta method, the 
implicit mid point rule, appears to have some advantage over split step time- 
semidiscretizations for the computation of wave trains for nonlinear Schrodinger 
equations [20, 32] because the latter introduce an artificial instability while the 
former reproduces recurrences well. 

In this paper, we shall hence restrict our attention to Runge-Kutta methods 
which have a long history as robust and effective time integrators for both ODEs 
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and PDEs; see, e.g., [29, 30]. Gauss-Legendre Runge-Kutta methods, in particular, 
have attracted attention as they are symplectic and yield multisymplectic space- 
time schemes for PDEs [2]. 

While we restrict attention to evolution equations on Hilbcrt spaces y, our results 
on differentiability of the semiflow and of the numerical method also hold true when 
y is a Banach space. However, the stability condition ||S n (/iA)|| < <d$ c nuJsh , which 
we need for our convergence result, is quite restrictive in the Banach space setting, 
as discussed above. 

The paper is organized as follows. In Section 2, we introduce the class of semi- 
linear evolution equations considered, and study the differentiability properties of 
the semiflow of these evolution equations. We also present a general result on 
the differentiability of superposition operators. We then show how the semilin- 
ear wave equation and the nonlinear Schrodinger equation with different types of 
boundary conditions fit into this framework. In Section 3, we derive corresponding 
statements on the wcll-posedness, differentiability properties, and convergence of 
A-stablc Runge-Kutta methods when applied to such evolution equations. In the 
appendix, we present a number of technical results, most notably a contraction 
mapping theorem on a scale of Banach spaces, which arc needed in the main body 
of the paper. 

2. SEMILINEAR EVOLUTION EQUATIONS 

In this section, we set up the framework for a class of scmilinear evolution equa- 
tions whose time discretization we analyze subsequently. After introducing some 
notation (Section 2.1) and setting up the general functional framework in Sec- 
tions 2.2, we provide a setting in which the semiflow is differentiable with respect 
to the initial data as well as time (Section 2.3). In many examples, the nonlineari- 
ties are superposition operators of nonlinear functions; we collect their fundamental 
properties in Section 2.4. These results enable us to fit our two main examples, 
the semilinear wave equation (Section 2.5) and the nonlinear Schrodinger equation 
(Section 2.6), into the abstract framework. 

2.1. Some notation. Let y be a Banach space. We write 
B%(U°) = {U€y:\\U-U%<R} 

to denote the closed ball of radius R around U° G y. (If no confusion about the 
space is possible, we may drop the superscript, or write Br(Uq) C y instead of 
BipiUo).) Let T> C y be open. We denote by T)~ s some open subset of y which 
satisfies 

V s = {UeV: disty(U,dV) >S}, (2.1) 

where disty(U,V) = miw^T>\\U — W\\y denotes the distance between a point U E y 
and the set V C y measured in the J^-norm. 

For Banach spaces X and y, and j G No, we write £i(y, X) to denote the vector 
space of j-multilinear bounded mappings from y to X; we set £ 3 '(X) = &{X,X). 

For Banach spaces X, y, and Z, and open subsets U C X, V C y, and W C Z, 
we write 

FeC M (MxV;W) 
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to denote a continuous function F : U x V — > W whose partial Frechet derivatives 
D^-DyF(X, y) exist and are such that the maps 

(X,Y,X l ,...,X i )^V i x T>iyF{X,Y){X 1 ,...,X i ) (2.2) 

are continuous from U x V x X 1 into £ J (3^, -E) for i = 0, . . . , m and j = 0, . . . , n. In 
particular, all directional derivatives are continuous. We write 

FeC[ Bn) (WxV;W) 

if, in addition, the partial Frechet derivatives are bounded and the maps (2.2) 
extend continuously to the boundary. (The latter is important as we will apply the 
contraction mapping theorem to maps in such classes.) As usual, we write 

FeC (m ' n) (UxV;W) 

to denote that the partial Frechet derivatives up to order (m, n) exist and are con- 
tinuous in the norm topology; we write C^ n ' n> if these derivatives are, in addition, 
bounded and extend continuously to the boundary. If any of the sets is not open, 
we define 

C {m - n) {U x V; W) = C (m ' n) (int(W) x int(V);int(W)) , 
where \nt(U) denotes the interior of U, with analogous notation for the Cb-spaces. 
The spaces C m (U; W) and Cg{U\ W) are defined likewise. 

Note that C^' n) (U xV;W) = & m ^{U x V; W) only if X is finite-dimensional. 
In general, 

Ci m ' n) (UxV;W)D C^ ,n) (U x V; W) n C^ n+1) (U x V; W) (2.3a) 

because any differentiable function is continuous. Moreover, 

C b fl ' fe) (WxV;W)= C b °' fc) (W x V; W) . (2.3b) 

In the above, V will typically be some interval of time. 

2.2. General setting. We consider semilinear evolution equations on a Hilbert 
space y, 

d t U = F(U) = AU + B(U) , (2.4) 

where U : [0,T] — > y. Equation (2.4) formally looks like an ODE, but will be 
thought of as being posed on an infinite-dimensional function space y. 
Our main examples are the following. 

Example 2.1 (Semilinear wave equation). For the semilinear wave equation 

d u u = d xx u - f(u) , (2.5) 

we write v = d t u and U = (u, v) T which, for t fixed, shall be an element of a Hilbert 
space y to be specified later, so that 

° «) ^ m - (_;,„ 



a . , and B(U) = " . (2.6) 



Example 2.2 (Nonlinear Schrodingcr equation). For the nonlinear Schrodinger equa- 
tion 

i d t u = -9 K2; u + duV (u, u) , (2.7) 

we set U = u, so that 

A = id 2 x and B(J7) = -i^j-F(u,u) . (2.8) 
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In the following, we introduce the framework in which we obtain smooth solutions 
of (2.4). Later, in Sections 2.5 and 2.6, we show how the semilinear wave equation 
and the nonlinear Schrddingcr equation as formally introduced above fit into this 
framework. It is well known that the following conditions imply the existence of a 
scmiflow of (2.4). 

(AO) A is a closed, densely defined linear operator on y and generates a Co- 
semigroup on y. 
(BO) B : V —> y is Lipshitz on some open set V C y. 

For the definition of strongly continuous semigroups (Co-semigroups) and detailed 
proofs, see, e.g., [27]. For our purposes, the main points can be summarized as 
follows. 

Condition (AO) implies, in particular, that there exist constants ui and such 
that for every t > 

||e* A ||<ee w * (2.9) 

with Re(spec A) < u>. Moreover, for every A <E C with Re A > w, 

ll(A-An|<^-. (2.10) 

After reformulating (2.4) in its mild formulation 

U(t)=e tA U°+ f e(*- s ^B(C/(s))ds, (2.11) 



the contraction mapping theorem applies and we obtain local-in-time well-posedness 
of our abstract semilinear evolution equation. 

Let $' denote the semiflow of (2.4), i.e. the map U° i-» $'(J7°) such that U(t) = 
$*([/°) satisfies (2.11) with C/(0) = U°. We sometimes write $(U°,t) in place of 
$*. When U° G D(A), then t i-> $*([/") is differentiable. 

2.3. Regularity of the semiflow. When B = 0, then t — > $*([/) is fc-times 
differentiable as a map from D(A k ) to y for every k G N. In this section, we 
extend this result to semilinear evolution equations under suitable assumptions on 
the nonlinearity B and provide bounds on the derivatives. 
For k £ N , we define 

y k = D{A k ) 

endowed with the inner product 



(Ul,U 2 )y k - (AU 1 ,AU 2 )y k _ 1 + (U ll U 2 )y k _ 1 . (2.12) 



Then 



\\A\\y t+1 _+ ye <l and \\U\\ yt < \\U\\ ye+l (2.13) 

forallt/e^+i. 

To state a differentiability result for higher time derivatives, we need the following 
specific assumptions on the regularity of B on the scale JVj ■ The same assumptions 
will also be required for the convergence analysis of A-stable Runge-Kutta schemes 
in Section 3. 



(Bl) There exist K S No, N <G N with N > K, and a nested sequence of 34; - 

nN 

'b 



bounded and open sets T>k such that B £ C^ k {T>k',yk) for k = 0, . . . , K. 
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We denote the bounds on B: T>k —t 34 and its derivatives by constants Mk, 
M k , etc., for k = 0, ...,K, and identify M = M , M' = M' , and V = V . In 
addition to the domains X>o, . . . , T>k defined in this assumption, we will sometimes 
need to refer to T>k+i, which may be any subset of T>k which is bounded and open 
in yx+i- 

Wc note that nonlinear continuous operators B G C(T>\ y) do not generally map 
closed bounded sets into closed bounded sets, sec Remark 2.3 below. However, the 
boundedness requirements can always be met on balls inside the domain of B, i.e., if 
B is C n from some open set V C Z to Z then, by continuity, for every U°eD there 
is some R > such that B: Br(U°) cP->Z and its derivatives are uniformly 
bounded so that B G Cg(B R (U );y). 

Remark 2.3. The existence of continuous unbounded nonlinear functionals on an 
infinite dimensional Banach space X can be seen by the following construction. It 
is a standard result that there exists exists a sequence Xj G X such that ||xj|| = 1 
and \xj — Xfc|| > 3/4; on a Hilbcrt space, an orthonormal basis will do. Now let 
hj G C(X,M.) have support on B*, 4 (xj) with hj(xj) = 1. Then F defined by 

oo 
3=0 

satisfies F G C(X,M), since we have hj(x) = for all but at most one j. But F but 
does not map the closed bounded set S 1 c '(0) into a bounded set. 

Superposition operators of smooth functions / : DcR^ M m on Sobolcv spaces 
as occur in Examples 2.1 and 2.2 above are bounded; for details see Sections 2.4-2.6. 
Indeed, for superposition operators condition (Bl) holds for all nested sequences of 
34-bounded and open sets T>k C On 34, where V is a 3^-bounded and open set, see 
Theorem 2.12. In our examples in Sections 2.5 and 2.6 we choose V^ = X>n£>jj (0) 
for some R > 0. Then (2.13) ensures that V k+1 c V k for < k < K and (Bl) 
holds. 

Given S > and a hierarchy of open sets T>( C 3^ for t = 0, . . . , L for L G N 
with £> = V, we define V Q S = V- 5 as in (2.1) and, for I = 1, . . . , L, 

V~ s = {U e V e : dist y (U,dV e ) > 5} . 

Then, by construction and due to (2.13), Bf e (U) C V e for all U G VJ 5 and 
i = 0, . . . , L and Vj s c Vj\ for £ = 1, . . . , L. 

Under assumptions (A0) and (Bl), the semi-flow $* of (2.4) exists on each 34- 
In the following, we show that a time derivative of order £ maps £ rungs down this 
scale of Hilbert spaces. 

Theorem 2.4 (Regularity of the scmiflow, local version). Assume (A0) and (Bl). 
Choose R > such that V K R ^ and pick U a G V K R . Let R* = 72/(28) with Q 
from (2.9). Then there is T* = T*(R, U°) > such that the semiflow (U, t) t-> $*([/) 
o/ (2.4) satisfies 

$G f) C^(££(^)x[0,Tj;fl£ fc -'(l7 )). (2.14a) 

j+k<N 
l<k<K 

In particular, 

* eCf (B%(U°) x [0,T,];^(C/ )). (2.14b) 
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The bounds on $ and T* depend only on the bounds afforded by (Bl), (2.9), R, and 
U°. 

Proof. Writing t = tT for some fixed T > 0, we see that a solution to the mild 
formulation (2.11) is a fixed point of the map 



H(W; U, T){t) = e TTA U + T f e (r " a)TA B(W(a)) da , 

Jo 



(2.15) 



This reformulation is useful because we want to quote the contraction mapping 
theorem on a scale of Banach spaces, Theorem A. 9, to prove the differentiability 
properties of $ as claimed. We work on the scale Zj — Cb([0, l];3^j) and seek a 
fixed point of II in Wj = Cb([0, 1]; B R 3 (U )) for j = 0, . . . ,K, with parameter sets 
U = int(B%K(U )) C X = y K and 1 = (0,7;). Clearly, n maps W 3 x U x X into 
Zj. To bound the range of II, we estimate, for j = 0, . . . , A", 

||n(W;tf,T)-[/°|| y . 

< \\^ TA U° U°\\ yj + \\e^ TA (U U°)\\ yj +TJ\\^-^ TA B{W{a))\\ y] da 

< \\c TTA U Q -U \\ yj +ee" T A*+T6e wT AA-. (2.16) 

With the choice A* = A/28, we observe that for sufficiently small T* and all 
T G [0, T*] the right hand side can be made less than A for j = 0, . . . , K independent 
of r G [0, 1]. We can thus take the supremum over r G [0, 1], which altogether proves 
that II maps Wj x U x 2" into Wj. Condition (i) of Theorem A. 9 then follows from 
our assumptions on A and B. 
Similarly, we estimate 

\\V w n(W;U,T)\\ £iCbi[0il] . y . )} <T@e» T M' j: (2.17) 

so that II is a uniform contraction for all U € U, W € W, and T G T = (0, T») with 
a possibly smaller value of T*. Here we used that B is at least C 1 on the highest 
rung of the scale due to the requirement that N > K in (Bl). Hence, condition (ii) 
of Theorem A. 9 is verified. 

Theorem A. 9 then implies that the fixed point W of n satisfies 

WG p| C^ £) (A^([/°)x[0,T.];W fe _,). 

j+fc<JV 
e<k<K 

To infer (2.14a), we recall that $ tT ([/) = W(U,T)(t); hence d^d^ T (U) = 
d^d^W(U,T)(l). Finally, (2.14b) follows from Lemma A.2. □ 

Remark 2.5. With the choice of norm (2.12), the fundamental estimates in this pa- 
per which carry named constants, in particular in (2.20) and A, c$ in Lemma 3.11, 
are the same on all 34 for k G No as these constants are norms of operators like e tA 
which commute with A. Thus, if 34 for k G N were endowed with a different, but 
equivalent set of norms, these and consequent constants would need to be adopted 
and possibly become dependent on the rung. 

Theorem 2.4 does not guarantee that the time of existence of the solution can be 
chosen uniformly over T> or even over T>~ for some S > 0. The following theorem 
shows that such uniformity can be obtained along with improved regularity over 
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bounded domains other than balls on the expense of requiring the initial data to 
lie in a set one step up the scale. 

In the following, we suppose that T>~^ +1 C TJ^ is 3^"+i-bounded and open and 
define 

R K+1 = sup \\U\\y K+1 . (2.18) 

UGV~ 5 

Theorem 2.6 (Regularity of the semiflow, uniform version). Assume (AO) and 
(Bl). Choose 5 > small enough such that T>x+i ^ ^- Then there exists T* = 
T*(<5) > such that the semiflow (U,t) t-> $*(£/) of (2.4) satisfies (2.14) with 
uniform bounds for all U° G T>J C+1 , with R = 5, and such that 

*e f) C b ^(^ +1 x[0,Tj;^). (2.19a) 

j+k<N 
£<k<K+l 

In particular, when N > K + 1 . 

$ G C* +1 (V K S +1 x [0,T*];£>) . (2.19b) 

TTie bounds on $ and T* depend only on S and on the bounds afforded by (Bl), 
(2.18), and (2.9). 

Proof. We apply Theorem 2.4 for each U° G T^k & +i witn R = 8. We note that in 
the proof of Theorem 2.4, even in the case K = 0, the guaranteed time of existence 
T* cannot be chosen uniformly for U° G T>~ s because the first term on the right of 
(2.16) cannot be made uniformly small. However, we may alternatively estimate, 
using (2.13) and (2.18), that for j = 0, . . . , K 



U°-U°\\ v <T max |Mc tA C/°|| v <TQc" T R 1+1 . (2.20) 

y i tS[0,T] " y i J 



Inserting this estimate into (2.16), we see that we can choose T* > small enough 
such that n( ■;U,T) maps Wj = B% (U°) into itself for all U° G V K S +1 and T G 
[0, T*]. Following the proof of Theorem 2.4, we find that (2.14a) holds with uniform 
bounds for all U° G T>^ +1 when R = S, thereby implying 



$G f) c£' ) (p^ +1 x[0,T*y,'D k -t) (2.21) 

j+k<N 
£<k<K 

with bounds which only depend on the bounds afforded by (Bl), (2.9), (2.18), and 
on S. Next, we prove that $ maps into a space one step up the scale, namely 

A$G p| cg A (p]f +1 x[0,T<]',y k -t). (2.22) 

j+k<N-l 

e<k<K 

Note that by [27, Theorem 6.1.5], a mild solution U{t) of (2.4) satisfies U(t) G 
D(A) if [/(0) G D(A) and B G C l {V,y)- thus, the formal identity AW{t)/At = 
T{AW(tT) + B(W(t))) for W(t) = U(tT) holds true. Hence, by applying A to 
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the fixed point equation (2.15) and integrating by parts, we find 

AW{t) = c tTA AU + T f Ac { - T - a)TA B{W{a)) da 
Jo 

= e TTA {AU + B{U)) - B(W(t)) 

+ T [ c ( - T -^ TA DB(W(a))(AW(a) + B(W(a))da . 
Jo 

This is a linear fixed point equation 
W = Ii(W, U, T){t) = e TTA (AU + B{U)) - B(W{r)) 

+ T f e ( T -^ TA DB(W(a))(W(a) + B{W{a)) da (2.23) 
Jo 

for W(U, T) = AW(U, T). We consider the fixed point equation (2.23) for W = AW 
with Wj = B r z . (0) for j = 0, . . . , K with r > big enough such that II maps each 
VVj into itself. Applying Lemma A. 6 (chain rule on the scale of Banach spaces) 
and Lemma A. 7 to the right hand side of the fixed point equation (2.23), we verify 
once more the assumptions of Theorem A. 9 with N replaced by N — 1. This yields 
(2.22). 

It remains to be shown that we can translate improved spatial regularity into 
differentiability in time by invoking the scmilincar evolution equation (2.4). Due 
to (2.21), Lemma A. 6 implies that B o $ is in the same class (2.22) as A<& and, 
since <9 t $ = A$ + B o $, so is 9 t <&. Combining this result, (2.21), and (2.22) via 
Lemma A. 4 implies (2.19a). 

Finally, (2.19b) follows from Lemma A. 2 with K replaced by K + 1. □ 

Remark 2.7. It is worth noting that, even though we find that $* maps into y^+i, 
the proof, being based on the fixed point problem (2.23), requires B to be defined 
only up to rung K. The same pattern occurs when studying the Runge-Kutta 
numerical time-ft. maps in Section 3. 

Remark 2.8 (Image of semiflow). The proof of Theorem 2.6 shows that, actually, 

$ e C^^'Va'+i x [o,TJ ; v K ny K+1 ) 

and 

$e f) C^\v- K 5 +1 x[Q,T^-D k _i). 

j+k<N 

£<k<K+l 

(k,£)jt(K +1,0) 

Remark 2.9. If A — —A* is skew-symmetric on the Hilbert space y, as for the 
nonlinear Schrddinger equation (see Section 2.6), then A is normal, iA is self-adjoint 
and, by Stone's theorem, generates a unitary group e . More generally, if A is skew- 
symmetric up to a perturbation which is bounded on all y k , as for the semilinear 
wave equation (see Section 2.5), then A generates a C group e tA on each y k . In 
both cases, (2.9) and (2.10) may be replaced by the following statement: There 
exist a constant u> with | Re (spec A) \ < lo and a constant such that for every 
t £ M. and for every A e C with |Rc \\ > uj 

e 



e 



11 <ee w|i| , IKA-A)- 1 !! < 



ReAl 
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see [27]. Then the semiflow $' is also a flow with interval of existence [— T*, T»] and 
regularity as specified in Theorem 2.4 and Theorem 2.6. 

2.4. Superposition operators. To study the well-posedness of evolution equa- 
tions such as (2.5) and (2.7), we need to consider superposition operators f'GtC 
Hi — ► 'Hi of functions f:GcR d ^ R d . This concept is widely used; see, e.g., 
[19, 27] for specific examples. In this section, we characterize superposition opera- 
tors in sufficient generality for later use. 

Let /= [oi] cMbea bounded closed interval. We write / Hf(/;]R d ) to denote 
the Sobolev space of functions u : I — > M. d whose weak derivatives up to order £ are 
contained in L^iJ', R d ). 

Lemma 2.10 ([1]). The space He(I;M) is a topological algebra for every £ > 1/2. 
Specifically, there exists a constant c — c(£) such that for every u,v £ 7^(7; R) the 
product uv £ Wi{I;'R) satisfies 

\\uv\\u t (iM) - c Mudi-M) Mudim ■ ( 2 - 24 ) 

Armed with this result, we can characterize more general superposition operators 
where a function /: G — > W n for some open G C R d induces a mapping u n> f(u) 
between function spaces. The /cth derivative of / as a function on R d is a k- 
linear map on M. d . As such, it induces a k- linear superposition operator between 
function spaces. A priori, it is not clear whether the fcth Frechet derivative of the 
superposition operator of / equals the superposition operator of the fcth derivative 
of / on M d . The following lemma and theorem provide a setting in which this is 
true, so that we use the symbol D fc / for both these objects. 

Lemma 2.11. Let G C M. d be open, let f <E C^(G;W n ) for some N e N , and set 

g = {ueC(I;R d ): u(I) cG}. 

Then f £ C^(0,C(I;W n )) and the derivatives of f as an operator from C(I;R. d ) 
to C(/;R m ) are the superposition operators of the derivatives of f as a function on 

R d . 

Proof. We proceed iteratively for n = 0, . . . , N. The Taylor theorem with integral 
remainder asserts that for fixed zq £ G 



/ 0) - 2^ — j\ — ( z ~ z °) 



<p(z ,z)\z-z \ n (2.25) 



p(z , z) = - max |D B /(zo + 9{z - z )) - D"/(« ))| 



j=0 

(when d > 1, D 4 / is an i-linear map acting on the tensor product (z — zq) 1 ), where 

1 
n\ 9e[o,i] 

is continuous in zq, z £ G and uniformly continuous for Zq, z £ K whenever K C G 
is compact. 

We now fix uq £ Q and let u £ Q. Clearly, uo(I) and u(I) are compact subsets of 
G, so that, setting zq = Uo(x) and z = u(x) in (2.25), we may take the supremum 
over x £ /, thereby obtaining 

4 =0 C(7;R-) 
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Since p(uo,uq) = 0, this proves that / G C n (Q;C(I;M. m )) and identifies the Frechet 
derivative of order n as the superposition operator of the derivative of order n on 
R d . 

Since / G C b (G,R m ), the set f(Q) is a bounded subset of C{I;M. m ). Moreover / 
extends continuously to the boundary of Q since / : G — > M m does. 

To prove boundedness of D k f as a map from Q to £ k (C(I;R d ),C(I; W n )) for 
k = 1,...,N, we employ its identification with the superposition operator of the 
k- linear map D k f on M d and estimate 

k 

||D fc /(«)(«i, ■ • •,n k )\\ cmm) < c l|D fe /(«)ll c(7;Kmdfc) nil^Hc(7;^) ( 2 - 26 ) 

4=1 

for some c > 0, noting that D fe / G Cb(G',C(I; W nd )) by the argument for the case 
k = 0. D 

The corresponding result on the Sobolev scale is as follows. 

Theorem 2.12. Let f G Cg (G;R m ) for some N £ N and open set G C R d . For 
each £ = 1, . . . , N, let Qi denote an TLi-bounded and open subset of Q n 'H.iiJ; R d ) 
with Q as in Lemma 2.11. Then 

feC^(g i; C 2 (L;R m ))n fl (%(g t ;Hi(I;R m )). 

k+l<N 

e>i 

The derivatives of f as an operator on Tie are the superposition operators of the 
derivatives of f as a function from R d to R m . 

Proof. The statement / S C? (Q\\ C%{iy) is a direct consequence of Lemma 2.11 
and the continuity of the embeddings %i(/;R m ) C C(L;W n ) C £ 2 (L;«. m ). (The 
first inclusion is due to the Sobolev embedding theorem.) 

Next, we show that / G C^(G;M. m ) is bounded as an operator from Qg to 
He(L; R m ) for 1= 1, . . . , N. We proceed inductively in £. Since, for some CV > 0, 

IMk < ft (HIh^ + IKIUJ (2-27) 

for w G T-Le(L), the inductive step is achieved by taking w = f(u) and showing that 
||9;c/(u)||"H{_i is bounded over u € Qi- Indeed, when £ = 1, ||/(u)||.C 2 is uniformly 
bounded in u G Q\ by the argument above, and there is a constant c\ > such that 

ll^/( W )ll£ 2 (/;K".) ^ C l ll D /(«)ll C (/;R*») W U ^\c 2 (I;M") 

is uniformly bounded for u G Q\ by Lemma 2.11. We conclude that / is bounded 
as map from Qi to TCi(L;W n ). When £ > 2, applying the algebra inequality (2.24) 
component-wise, we estimate 

\\d x f(u)\\ ne _ i{I . Rm) < c 2 ||D/(w)||^_ i(/;Rdm) K||^_ i(7;Rd) , 

for some constant c 2 > 0, where the right side is uniformly bounded for u £ Qi since 
\\^ f ( u )\\n e -i(i ,^ dm ) ^ s uniformly bounded for u G Qi-\ by induction hypothesis. 
Thus, by (2.27) with w = f(u), using the induction hypothesis once more, we 
obtain boundedness of /: Gt -)■ %(/;R m ). 
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To prove continuity and continuous differentiability of / : Qn — > He, we introduce, 
for fc = 0,...,iV-l, 

k 

F k (u , u) = /(«) - £ ^IIM („ _ Uo )« 



and write d x F)~(uo,u) in the form 



9xF fc = 



X^ D i+1 /(«o) 



D fe+ _V( Uo ) 
fc! 



9xU + - / v " u; (« - u ) fe 9 x (u - uo) 



D/(«)-^ ^(«-«o) ! 

L «=0 

When £ = 1, we estimate for every fc = 0, . . . , N — 1, using Lemma 2.11 and the 
Sobolev embedding theorem again, that, for u £ Qi, 

k 



W d * F l°\\Hi-i(W m ) - C3 



D /( w ) - 2^ fi ( u ~ M °) 



C(J) 



4=0 

+ ° 3 fc! H Dfc+1 ^ u °)llc b (/) H« ~ u o\\c(i) ll u - "olU ( /) 
<a(uo,u)\\u-u \\ nt{I) (2.28) 

for some a £ C(Qi x Qf, Rq~) with c(uo, u) = and some constant C3 > 0. Moreover, 
since / £ C N (Qi;C2), there exists a function u £ C(Qi x £/i;Rq") with u(uq,u) = 
such that ll^-lUa -= w ( w 07"«) ||u — Mo||^ • Hence, (2.27) with w = Fk(u , u) implies 
f£C N - 1 {g 1 ;U l ). 

When £ > 2, we obtain, by applying (2.24) recursively and component-wise to 
the second term of d x Fk, an estimate as on the first and second line of (2.28) 
with He-i(I) in place of C(I) for every fc = 0, . . . , N — I. Applying the induction 
hypothesis to both / and D/ shows, as before, that / £ C N ~ t (Gf,'Hi) and that its 
derivatives are the superposition operators of the derivatives of / as a function on 
R d . 

Due to this identification, we can prove boundedness of D fe / as a map from Qi 
to £ k (Hi (/;R d ),7^(/;R m )) by applying (2.24) recursively and component-wise to 
D k f(u)(ui, . . . , Mfc). In this way we obtain an estimate of the form (2.26) with He 
in place of Cb- The bound is then achieved by noting that D k f : Qt -)■ Ht(I; R md ) 
is a bounded operator by the argument provided earlier in this proof for k + £ < N. 

Finally, we need to show that B k f: Q t -> £ k (He{I;M d ),Ht(I;W n )) extends 
continuously to the boundary of Qe when k + i < N. For k = this follows 
recursively from (2.28) and / £ C^(Q;C(I)) as above. Applying this result to 
D fc /: Qi — ► He(I;M. md ) and using once again the identification of derivatives of 
the superposition operator with the superposition operators of the derivatives, we 
complete the proof. □ 

2.5. Example: the semilinear wave equation. In the case of the semilinear 
wave equation (2.5), the operators A and B are given by (2.6). 

2.5.1. Periodic boundary conditions. Since the Laplacian is diagonal in the Fourier 
representation, it is easy to see that the spectrum of A is given by spec A = {ifc : k £ 
Z} and that the group generated by Qo^4 is unitary on any 

y t = H i+1 {I;M) xHi{I;M) for I £ N . 
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Here Po is the spectral projection associated with eigenvalue and Qo = id — Po- 
Hence, A generates a Co-group on J^ and assumption (AO) is met. The full group 
e , however, is not unitary due to the secular term from the Jordan block of A 
when restricted to Po3^. 

Assume that the nonlincarity / of the scmilincar wave equation (2.5) satisfies 
/ G Cg(G;R) for some N e N and some open set Gel, and let V = V u x V v 
where T> u is the set Q\ from Theorem 2.12 and T> v denotes an open bounded subset 
of C-2{I)- Then, by Theorem 2.12, the nonlinearity B satisfies assumption (Bl) on 
the scale defined above with K < N. Hence, Theorems 2.4 and 2.6 give regularity 
of the flow of the semilinear wave equation on the scale y k defined above. 

2.5.2. Neumann boundary conditions. In the case of Neumann boundary conditions 
on I = [0, ■k] , we set y = Hi(I, K) x C 2 (I, R) as before; the operator A then has the 
same spectrum and e'® " 4 is again unitary. In this case, 34 = ^fc+i {I, R) x %£ b (I, R) 
with 

Hf (J,R) = {u g W fc (7,R): u (2j ' +1) (0) = u (2j ' +1) (tt) = for j = 0, . . . , [k/2\ - 1} . 

When Gc lis open and / G C^ (G; R) , assumption (BO) holds as before on the 
open bounded set T> C y defined above. We claim that (Bl) also holds for K < N. 
To prove the claim, we must show that / maps 'H\z r i{I, R) H T> u into H^ h (I, R) 
for k = 0,...,K. When k = 1, no boundary conditions need to be checked. 
When k = 2, we observe that (d x f(u))(x) = f'{u{x))u x (x) = for x = 0, w and 
u e %2 b H X>„, so /(u) £ H" b . Further, when fe = 3, . . . , K , all terms in the sum 
obtained from computing o%? f( u ) contain at least one odd derivative of u of order 
at most 2j + 1, so the boundary conditions remain satisfied. 

2.5.3. Dirichlet boundary conditions. When endowed with Dirichlet boundary con- 
ditions, A generates a unitary semigroup. We take / = [0, w] as before and set 
y k = H° k+1 (I,R) x U° k (I,R), where 

n° k (I,R) = {ue ?4(/,R): w (2j) (0) = u (2j) {tz) = for j e N with 2j < k - 1} . 

Let G C R be open with 6 G and let / G C^(G, R) as before. Then condition (BO) 
is satisfied with V = V u x V v , as before. Condition (Bl) is satisfied if /^-"(O) = 
for < 2j <K-1. 

When / does not satisfy these boundary conditions, necessary conditions for the 
existence of time derivatives take a complicated structure. To see this, it suffices 
to consider differentiability at t = 0. For U'(0) to exist, we have the obvious 
requirement that v(0, 0) = v(0, ii) = 0. For U"(0) to exist, the non-homogeneous 
boundary condition d x u(0, 0) = d x u(0, n) — — /(0) needs to be satisfied. For £/'"(0) 
to exist, <9^u(0, 0) = 9^w(0,7r) = must hold. Finally, for t/' 4 '(0) to exist, a 
straightforward computation shows that 9 4 it(0, x)+f"(0) u 2 (0, x) = /'(0) /(0) must 
hold at .t = 0, 7r. This nonlinear boundary condition is difficult to handle, and in 
this situation the space of initial conditions allowing temporally smooth solutions is 
not an open set in a suitable Hilbert space. Therefore, we restrict our attention to 
nonlinearities B of the semilinear evolution equation (2.4) which satisfy condition 
(Bl). 

2.5.4. The semilinear wave equation on the line. When i" = R, we take y = Hi(M.) x 
/^(R)- Using the Fourier transform, we verify that e is unitary on y. Lemma 
2.10 remains valid with I = R, but the assertions of Theorem 2.12 only hold true 
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provided G G and /(0) = 0. For example, when / is a polynomial without 
constant term and T> — £>^(0) for some R > 0, then B satisfies condition (Bl) and 
Theorem 2.4 applies. 

2.5.5. A semilinear wave equation in an inhomogeneous material. Instead of (2.5), 
let us consider the non-constant coefficient semilinear wave equation 

dttu = d x (ad x u) + bu + f(u) 

where a, b G C^(I; R) with a(x) > and b(x) < for x G /. For periodic boundary 
conditions and on the line, the setting and conclusions of Sections 2.5.1 and 2.5.4 
apply. For Dirichlet boundary conditions on / = [0, 7r], the spaces 34 also carry 
over from Section 2.5.3 and it is straightforward to verify that (Bl) is satisfied with 
K = 4 provided /(0) = /"(0) = and N > K. 

We remark that that the semilinear wave equation in inhomogeneous media 
can, in principle, be solved numerically by splitting methods (see the introduction 
for references). Here, however, splitting methods lose their advantage because the 
explicit computation of e is expensive for operators with non-constant coefficients. 

2.6. Example: the nonlinear Schrodinger equation. We first consider peri- 
odic boundary conditions. In this case, the Laplacian is diagonal in the Fourier 
representation with eigenvalues — k 2 and A generates a unitary group on Li{I\ C) 
and, more generally, on He(I;C) with I G No- 

In the notation of Section 2.2, we choose 3^ = H-21+i (I', C). Then (A0) is sat- 
isfied. If the potential V(u,u) satisfies V G C ] f +2+N (G;R) with K < N for some 
open subset G c M 2 = C then, by Theorem 2.12, the nonlinearity B defined in 
(2.8) satisfies assumption (Bl). Therefore, Theorem 2.4 and Remark 2.9 assert the 
existence of a flow $ on y and specify its regularity. 

In the case of Neumann boundary conditions, we choose 3^ = %2£+i(^' Q with 
I = [0, n] and %£j? +1 defined in Section 2.5.2, so that (Bl) is satisfied. 

In the case of Dirichlet boundary conditions, we choose 3^ = "H^e+i (^iQ as 
defined in Section 2.5.3. Then (Bl) is satisfied for any V(u,u) = v(\u\ 2 ) where 
v G C b +2+ (R(J";K), in particular for the standard case where V(u) = \u\ 4 /2. 

On the line, 3^ = %^+i(KjC) and condition (Bl) is satisfied if, for example, 
V(u) is a polynomial in u\ = Re(w) and ui = Im(w) with no linear term, so that 
f(0) = d u V(0,0) = 0. 

Remark 2.13. While the setup in this section concern PDEs in one spatial dimen- 
sion, our results on superposition operators can be extended to "nice" n-dimcnsional 
spatial domains since Lemma 2.10 holds on Hf(J7,R d ) for I > n/2 [1], when, e.g., 
fl C R" is a domain with smooth boundary or $7 = R™. So we could also consider 
the nonlinear Schrodinger equation on R 2 and R 3 . 

Remark 2.14 (Inhomogeneous boundary conditions). We can treat inhomogeneous 
time-independent mixed linear boundary conditions of the form BC([/) = g for the 
above examples by solving Av = 0, BC(i>) = g and then applying a Runge-Kutta 
method to U — v. This is equivalent to applying a Runge-Kutta method to U with 
boundary conditions BC(U) = g, cf. the discussion in [24]. 
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3. ^4-STABLE RUNGE-KUTTA METHODS ON HlLBERT SPACES 

In this section, we first prove an abstract convergence result for discretizations 
of evolution equations on Hilbert spaces. Then, in Section 3.2, we introduce a 
class of A-stable Runge-Kutta methods which are well-defined when applied to the 
semilinear PDE (2.4) under assumptions (AO) and (BO). In Section 3.3, we study 
the regularity of A-stable Runge-Kutta methods under the additional condition 
(Bl) and finally apply the abstract convergence result to those schemes. 

3.1. An abstract convergence theorem on Hilbert spaces. In this section 
we prove an abstract convergence result for evolution equations on Hilbert spaces, 
Theorem 3.1. Although this theorem is modeled after the basic local convergence 
result for ODEs and there are a lot of results on the convergence of time discretiza- 
tions of specific PDEs in the literature, see Section 1, we are not aware of any result 
that is as general as this theorem. 

In the classical setting of ordinary differential equations y — f(y), a one-step 
method y n+1 = ip h (y n ) is of order p if, formally, y(h) —ip h (y°) = 0(h p+1 ). In other 
words, the local error is controlled by the Taylor integral remainder of order p + 1. 
It is then easy to show that the method is globally convergent of order p; see, e.g., 
[13]. 

The situation is more subtle in the case of a differential equation 

U = F(U) (3.1) 

on a Hilbert space X: First, it is not clear whether the time-/i map ^ h associated 
with a given one-step method applied to (3.1) is well-defined as map from an open 
subset of X to itself. It depends on the equation and on the chosen one-step method, 
and typically fails for explicit Runge-Kutta methods. Second, even if U >->■ \& (t/) 
is well-defined and continuous, its derivatives with respect to h will usually fail 
to be defined on the same set. Thus, in order to control the Taylor remainder 
U(h) — ^' l ([/°) in the case of a discretization of a PDE (3.1), we must consider 
the remainder as a map from a space Z of high regularity into a space X of low 
regularity. In this setting, the usual proof that consistent one-step methods are 
convergent applies under the following assumptions. 

Let X and Z C X be Hilbert spaces, where Z is continuously embedded in X 
and let $> h be a one-step discretization of (3.1) which is of classical order p. Assume 
there exist sets T>x C X and T>z C Z such that T>x is open in X, T>z C T>x, and 
there exist constants h* > 0, 0» > 0, such that the following hold. 

(CI) For fixed h E [0,h*\, the map U (-► ^ h (U) is ^{Vx^X). Moreover, there 
exists a possibly /i-dependent norm || • \\x.h on X with 

\\U\\x<\\U\\x, h <e*\\U\\x (3.2) 

for all U € X and h 6 [0, h*] such that 

sup \\m h (U)\\ £(x)h = l + 0(h) (3.3) 

for all h g [0, ft.,]. Here, IHIg^ h denotes the operator norm induced by 

INI*.*- 
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(C2) For fixed U eV z , the map ft h4 *' l ([/) is in C p+1 ([0, K]]X), and 

sup K +1 ^(l/)|U<oo. (3.4) 

Ud-Dz 
h£[0,h,] 

Condition (CI) can be seen as a stability condition, whereas condition (C2) ensures 
consistency. 

Theorem 3.1. In the setting above, fix U° £ T>z and suppose that there exists a 
solution 

U EC({0,T};V z )nC p+1 ({0,T};V x ) (3.5) 

to the initial value problem (3.1) for some T > with U(0) = U° . Let "J' 1 be a 
one-step discretization of (3.1) of order p > 1; let U m = (^> h ) m (U°) denote the 
associated numerical solution. 

Then there exist constants ft* > 0, c\, and C2, depending only on T, the norm 
ofU inC p+1 {[0,T];X), dist x ({U(t): t £ [0,T]},&D x ), and on the constants from 
(3.2), (3.3) and (3.4), such that for every h £ [0,ft*], 

|| U m -U{mh) \\ x <c 2 e Cimh h p 
so long as mh < T. 

Proof. Since T>x is open, there is some 5 > such that Bf(U(t)) C V x for each 
t £ [0,T]. Setting 

E m = \\U m -U(mh)\\ Xth , 

we estimate, with $*(t/(s)) = U(t + s), 

E m+1 < \\* h {U m ) - * h (U{mh))\\ Xih + \\^ h {U(mh)) - $ h (U(mh))\\ Xth 

< sup \\D-$ h (U(mh) + 9 (U m ~ U(mh)))\\ £(x)h E m 
0e[o,i] k h 



e* ftp +1 

— sup 

[p + iy. s&[o,h] 



\\d p s +1 ^(U(mh))\\ x + \\d p+1 ^(U(mh))\\ x ) 



< sup \\D* h (U)\\ £ , x)h E m 

+ i-rw sup ( sup \\ d z +l * h ( u m x + \\d? +1 u(t)\\ x ) 

(P+ tJ ! *e[o,T] v he[o,h»] ' 

< (1 + ci ft) £ ra + c 3 ft p+1 . 

The suprema in the estimate above are finite due to (3.3), (3.4) and (3.5), respec- 
tively, so long as E m < S since then, due to (3.2), U(mh) + 9 (U m - U(mh)) £ V x . 
Thus, since E = 0, 



Em < c 3 hT» (1 + \ Cl)? "' 1 < * (l + ?**) m W < c 2 e— h 
ftci ci \ m J 



hP . 



Thus, we can choose ft* small enough such that E m < 5 for all m < T/h*. This 
concludes the proof. D 

Remark 3.2. The proof of Theorem 3.1 does not use any Hilbert space structure, 
so that the result holds true when X and Z are Banach spaces. However, condition 
(CI) is rather restrictive on general Banach spaces, see Remark 3.7 below and the 
discussion in the introduction. 
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3.2. Regularity of ^4-stable Runge Kutta discretizations. Applying an s- 
stage Runge-Kutta method to the semilinear evolution equation (2.4), we obtain 

W = U°l + ha (AW + B{W)) , (3.6a) 

U 1 = U° + h b T (AW + B(W)) . (3.6b) 

We write, with U E y, 

/[A {W l \ (B(W X ) 

1U = ! e y 8 > w =\ \ > B ( W ) = : 

where W 1 , . . . , W s are the stages of the Runge-Kutta method, 

s s 

(aWy = Y, a u W° , b T W = Y, bj W j , 

and A acts diagonally on the stages, i.e., (vlW) 1 = AW^ 1 for i = 1, . . . , s. 

Written this way, it is not transparent that, under certain conditions, this class 
of methods results in a well-defined numerical timc-h map ty h on a Hilbcrt space 
y. A more suitable form is achieved by rewriting (3.6a) as 

W = IL(W; U, h) = (id -haA)- 1 (1U + haB(W)) . (3.7) 

Noting that 

(id -haA)- 1 = id +haA (id -haA)- 1 (3.8) 

and inserting (3.7) into (3.6b), we obtain 

*' l (t/) = U + hb T (AW{U, h) + B(W(U, h))) 

= S(hA)U + hb T (id -haA)- 1 B(W(U, h)) , (3.9) 

where S is the so-called stability function 

S(z) = l + zb T (id-za)- 1 l. (3.10) 

We now make a number of assumptions on the method and its interaction with 
the linear operator A. First, we assume that the method is A-stable in the sense of 
[24]. Setting C~={zeC: Rez<0}, the conditions are as follows. 
(RK1) The stability function (3.10) is bounded with |S(z)| < 1 for all z e C~. 
(RK2) The s x s matrices id— za are invertible for all z 6 C~. 
Sometimes, we will also assume that a is invertible. 

Remark 3.3. The matrix id — za is invertible for all z £ C - if and only if a has no 
eigenvalues in C~ \ {0}. Its inverse is then bounded uniformly for z £ C~ by a 
constant A > 1 (insert, in particular, z = 0). 

Remark 3.4. In general, Runge-Kutta methods are called .A(0)-stable for some 
9 G [0,7r/2] if |S(z)| < 1 for all z G C with |arg(-z)| < 0; see, e.g., [13]. A 
definition of A((9)-stability that requires, in addition, invertibility of id— za was 
introduced by Lubich and Ostermann [24] in the context of parabolic equations; 
their results also depend, to a large extent, on the invertibility of a. Thus, our 
assumptions can be described as A(0)-stability for 9 = ir/2 in the sense of [24]. 
Note that the requirement 9 = n/2 arises as we include operators A which are not 
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necessarily sectorial, but whose spectrum may, for example, contain a strip about 
the imaginary axis, cf. Sections 2.5 and 2.6. 

Example 3.5. The implicit midpoint rule has stability function S(z) = (l + z/2)/(l — 
z/2), s = 1, an = hi an d bi = 1. Conditions (RK1) and (RK2) are readily verified; 
moreover, a is invertiblc. 

Lemma 3.6. Gauss-Legendre Runge-Kutta methods satisfy (RK1) and (RK2) 
with a invertible. 

Proof. Condition (RK1) is the classical notion of A-stability; it is proved for Gauss- 
Legendre methods in [13, Theorem 6.44], for example. 

To verify condition (RK2), write S(z) = P(z)/Q(z) as the quotient of polynomials 
P and Q with no common roots. We claim that Q(z) = det(id— za). To see this, 
note first that dct(id — za) arises naturally as the common denominator when solving 
for the terms of an explicit rational expansion of (id — za) -1 by Cramer's rule; see 
the proof of [13, Lemma 6.30]. The claim follows if we can show that the numerator 
does not have any factor in common with det(id— za). Since p = 2s for Gauss- 
Legendre methods [13, Theorem 6.43], dcgQ < s and deg P < s for s-stagc implicit 
Runge-Kutta methods [13, Lemma 6.30] and, generally, p < deg P + degQ [13, 
Lemma 6.4], we conclude that deg P = deg Q = s so that indeed Q(z) = det(id — za). 

Since, by (RK1), the rational function S is nonsingular on C - , all eigenvalues 
of a must lie outside of C~ \ {0}. This proves invertibility of id— za on C~, cf. 
Remark 3.3. Finally, since Q(z) = det(id— za) has degree s, a must also be nonsin- 
gular. □ 

For the convergence analysis in Section 3.3, we need the following additional 
assumption on the operator A and on the scheme. 

(Al) Assumption (A0) holds, and there exist constants ws,6s,ft* > such that 
for all h e [0, ft»] and neN, 

||S n (M)||y_ ,<e s e''* nh . (3.11) 

If assumption (Al) holds, we define, for U € y, 

\\U\\ yih = sup e-™* h \\S n (hA)U\\ y . (3.12) 

nGNo 

Then || • \\y ^ is equivalent to the y-norm in the sense of (3.2) with O* = Gs- 
Moreover, there is some a > such that 

\\S(hA)\\ £{y]h <c^ h <l + ah (3.13) 

for h e [0, ft*]. 

Remark 3.7. When an ^4-stablc Runge-Kutta is applied to discrctize a general 
Co-semigroup c tA on a Banach space, estimate (3.11) is in general false. A counter- 
example is the implicit midpoint rule applied to A = d x on £i(R) [21]. When A is 
a sectorial operator, then (3.11) is satisfied [26]. 

Remark 3.8. In the time-continuous case discussed in Section 2, the estimate corre- 
sponding to (3.11) is (2.9). Note that, by replacing the 3^-norm with the equivalent 
norm \\U\\ = sup t>0 e _w * ||e' A ?7||;y the constant in (2.9) becomes 1, analogous to 
(3.13). 
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We state the following sufficient condition for (Al), which is often satisfied in 
applications. 

(A2) Assumption (AO) holds, y is a Hilbcrt space, and A = A n + A^ with A n 
normal and Ah bounded as a linear operator on each y<j, • • • , 3^- 

(Recall that an operator A is normal if it is closed and AA* = A* A.) Condition 
(A2) implies that the non-normal part A^ of A can be included with B as it satisfies 
the sufficient condition (Bl). Note that A^ is a bounded linear operator on each 34 
if, for example, A^ = FA and A n = QA is normal, where P is a spectral projector 
of A onto a finite dimensional subspace and Q = id —P. 

Remark 3.9. In the case of the semilinear wave equation, see Section 2.5, assump- 
tion (A2) is satisfied with A\> = PoA, where Po denotes the spectral projection 
corresponding to the eigenvalue of A. In the case of the nonlinear Schrodinger 
equation, see Section 2.6, the operator A is normal, so that (A2) holds trivially. 

Lemma 3.10. Assume that (RK1) and (RK2) hold and that A satisfies conditions 
(A2). Then (3.11) is satisfied with 0$ = 1. 

Before we can prove Lemma 3.10, we need some technical estimates on the 
operators which appear on the right of equation (3.9) and (3.7). In the following, 
we denote s copies of y by y s and use the norm 



W||y. = max \\W*\\ 



j=i 



y 



Lemma 3.11. Assume (RK2) and (AO). Then, for ft,* > small enough, there 
exist A > 1 and c$ > 1 such that 

IKid-fc^)- 1 !!-,,.^.,,. <A (3.14a) 



\\haA(id -haA)- 1 \\ y „_> y , < 1 + A (3.14b) 
for all h £ [0, ft*]. Moreover, for any £, n, £ No, 

{W,h) f-4 {\&-hzA)- l W is a map of class C^^ x [0, ft*]; F) , (3.15a) 

(W, ft) M- ftaA(id -hzAy l W is a map of class C^' l) {yf x [0, ft*]; y s ) , (3.15b) 



and 



(W, ft) -4 ft(id -hzA)- x W is a map of class C^ E+1) {y s t x [0, ft*]; y s ) . (3.15c) 



Remark 3.12. Estimates of the form (3.14) were proved in [24] under the assumption 
that A is sectorial. 

Proof. Transforming a into Jordan normal form, we see that there exists a constant 
c = c(a) such that 

IKid-ftaA)- 1 ^^ < c^maxJKid-ftA^r 1 !!^ 
where Ai, . . . , Xk are the eigenvalues of a with algebraic multiplicities mi, . . . , rnk- 
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Hence, let A be one of the eigenvalues of a; we know that Re A > due to 
assumption (RK2) and Remark 3.3. Referring to (2.10), we estimate, for ft > 0, 

Thus, the right hand bound is positive and finite for all ft £ [0, ft*] provided that 
ft* > is small enough. This proves estimate (3.14a). Due to identity (3.8), 
estimate (3.14b) follows immediately. 

To prove continuity of the map (id— haA)~ 1 W : [0, ft*] — ► W for fixed W e y s , 
we proceed as follows. Let e > 0. Then for every W\ G y(, h,h' € [0, ft*], 

Wiid-haA^W - (id-tiaAy^WWys 

< IK(id-ftaA)- 1 - (id -ti a Ay 1 )Wt\\ ya + \\(id-haA)- 1 (W- Wi)\\ y . 
+ \\(id-ti a Ay 1 (W-Wt)\\ ys 

< || ((id -haA)- 1 - (id -tiaAy^Wi \\ ys + 2A || W - Wi\\ y , (3.16) 

where the second inequality is based on (3.14a). Now, since A is assumed to be 
densely defined and ^i = D(A), we can choose W\ so close to W that the last term 
on the right is less than e/2. Then, since W\ £ 3^i, there exists a 6 = 5(W\) such 
that the first term on the right is less than e/2 whenever |ft — ft'| < S. This proves 
continuity of ft i-> (id — haA)~ 1 W on the interval [0, ft*]. 

To complete the proof of (3.15), we must compute the ft-derivatives of the map 
(3.15a). Once we have shown (3.15a), estimate (3.15b) follows immediately via 
(3.8). First, 

di (id -fta.4)- 1 = i\ {aAf (id -fta^)^" 1 . 

Using estimates (2.13) and (3.14a), and noting the continuity of ft t-> (id —haA)^ 1 ]^ 
proved above, (3.15a) follows. Finally, noting that 

d h [h(id-haA)- 1 ] = (id-ftaA)- 1 + ftaA(id-ftaA)- 2 = (id-ftaA)" 2 , 

we obtain 

^[ft(id-ftaA)- 1 ] = df^ 1 (id -haA)' 2 = i\ (aA)^ 1 (id-haA)-*- 1 , 

which implies (3.15c). □ 

Proof of Lemma 3.10. Recall that Re(spccA) < u> for some u> > so that the 
spectrum of A — oj is contained in C~. Moreover, by assumption (A2) we can split 
A into a normal part A n and a bounded part At,. Now decompose A = A\ + A% 
with A\ — A n — u and Ai = A], + u>. We now apply the Runge-Kutta scheme to 
the linear problem d t U — AU in two different ways. First, we take the full A and 
B = 0; second we take A replaced by At and B(U) = A^U . Since the respective 
numerical time-ft maps given by (3.9) must be the same, we obtain the identity 

S(hA) = S(hA!) + hb T (id -ha At)" 1 A 2 W (3.17a) 

where 

W = (id-haA 1 )- l (l + haA 2 W). (3.17b) 

Rewrite (3.17b) as W = MW + G with 

M= (id-haA 1 y 1 haA 2 and G = (id -fta/li)" 1 1 . 
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Since Ai is bounded and, by Lemma 3.11, (id — haA\)~ l is uniformly bounded for 
ft G [0, ft*], the matrix M has norm smaller than one for ft G [0, ft.*] with some 
possibly smaller h* > 0. Consequently, we can solve for W = (id— M)~ 1 G, whence 
the second term on the right of (3.17a) is 0(h) in the norm of £(y). 
As Ai is normal with spec A\ C C~, we have, referring to (RK1), 

WSihA^Wy^y < SUP |S(/lA)|<l. 

AGspec A\ 

Altogether, this proves that there exists a > such that ||S(/i^4)||-v,_^-y < 1 + a h 

for ft G [0, ft.*]. This in turn implies (3.11) with 0$ = 1. □ 

Next, we describe the differentiability properties of S(hA) which will be needed 
later on. 

Lemma 3.13. Assume (RK2) 7 (A0), and either that the Runge-Kutta matrix a is 
invertible or that (AI) holds. Then there exist ft* > and c$ > 1 such that for all 
h G [0,h*\, 

\\S(hA)\\ y ^ y <c s (3.18) 

and, for all i, n G No, 

([/, h) H- S(hA)U is a map of class C^Q^ X [0, hJ-,y) . (3.19) 

Proof. First, (3.18) clearly holds when (AI) holds. To prove (3.18) when a is 
invertible, we estimate, using (3.10) and (3.14b), 

\\S(hA)\\ y ^ y <l + S ||b||]|a- 1 ||(l + A) = c s . 

Next, we show that S(hA)U : [0, ft*] — > y is continuous for every U G y as in the 
proof of Lemma 3.11, replacing (id— haA)^ 1 by S(hA), A by cs, and y s by y in 
(3.16). This proves (3.19) for t = 0. 

Finally, to prove (3.19) for £ G N, we note that, due to (3.15c), the map (W, ft) H» 
hA(id -haA)- 1 W is of class C^tyf x [0, ft*]; ^ s ); the claim then follows directly 
from the definition of S in (3.10). □ 

In Theorem 2.4, we studied differentiability in time of the semiflow $* of (2.4). 
An analogous result holds for differentiability of the discretization 1 $ h of (2.4) in 
the step size ft. 

Theorem 3.14 (Existence and regularity of numerical method, local version). 
Assume that the semilinear evolution equation (2.4) satisfies conditions (A0) and 
(Bl), and apply a Runge-Kutta method subject to condition (RK2) to it. Moreover, 
assume that (AI) holds or that the Runge-Kutta matrix a is invertible. Choose 
R>0 such that T>~ R ^ and pick U° G V~ R . Let i?* = i?/(2max{c s , A}) with c s 
from (3.18) and A from (3.14). Then, for sufficiently small ft* > 7 there exists a 
unique stage vector W and numerical time-h map ^>{U,h) = *f> h (U) which satisfy 

LF\*g f) C { h l ' e \B^(U°)x[0,K};Bf- e (U )) (3.20) 

j+k<N 
£<k<K 

for i = 1, . . . , s . In particular, 

W\*e C b A '(^f(t/°) x [0,ft*];^(C/ )) (3.21) 
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for i = 1, . . . , s. The bounds on W , ^ and h* depend only on the bounds afforded by 
(Bl), (3.14), (3.18) 7 on the coefficients of the method, R, and U . If in addition, 
(Al) holds, then there exists a constant a^, such that for h £ [0, h„] with a possibly 
smaller choice of h* > 0, 

sup \\D* h (U)\\ e(y)h <l + o-*h, (3.22) 

UGB%JU a ) 

where the norm on the left is defined by (3.12) and /i* and o~q, depend only on the 
above quantities and on the constants in (Al). 

Note that statement (3.21) for "J is analogous to (2.14a) for the semiflow $. 

Proof. We apply the contraction mapping theorem on a scale of Banach spaces, 
Theorem A. 9, to the map from (3.7), 

U(W; U, h) = (id -haA)- 1 1U + ha (id -haAy 1 B(W) 
with u = U, w = W, and fi = h on the scale Zj = y? for j = 0, . . . K. We further 
identify X = y K , Wj = B R (1U°) C y/, I = (0,ft*)> and U = intS fl .(C/°) C y K . 
To verify condition (i) of Theorem A. 9, we note that equation (3.15a) of Lemma 3.11 
asserts that the map (U, h) h+ (id — haA)^ 1 1U is, in particular, of class 

f) cf' fl (D K x[0,ft,];^). 

j+k<N 
£<k<K 

The differentiability assumptions on _B from (Bl) are precisely such that the map 
(W, h) i-»- ha (id — ZiaA) -1 S(W) is of the same class. 

First, we show that II( • ; U, h) maps Wj, j = 0, . . . , K , into itself for fixed U eU 
and h s [0, /i*] with appropriate h* > 0. We begin by taking /i* as in Lemma 3.11 
and estimate, for W € Wj, 

\\U(W;U,h)-lU°\\ z . < \\(id-(id-haA)-i)lU°\\ Zi 

+ \\(id-haA)- 1 \\ y ^ Zj \\U-U°\\ yj 

+ h\\(id-haA)- 1 a\\ y ^ Z] \\B(W)\\ Z] 

< IKid-Cid-fcaA)- 1 )!!/ !!^ + Ai?» + /iA||a|| Mj, 

(3.23) 

where, in the last step, we have used (3.14a) from Lemma 3.11 and Mj is the 
bound on B from condition (Bl). Since, again by Lemma 3.11, the map h i— > 
(id—haA)~ 1 W is continuous on each Zj, we can possibly shrink h* such that the 
right hand side of (3.23) is less than R. This proves that LT( • ; U, h) maps Wj into 
itself and implies condition (i) of Theorem A. 9. 
Next, for j = 0,...,K, 

\\T> w n(W;U,h)\\ Zi -> Zi <h\\(id -haA)- 1 a\\ z .^ z . \\DB(W)\\ Zi _+ Zj 

<hA\\a\\M'j. (3.24) 

Thus, by possibly shrinking h* again, the right hand bound can be made less 
than 1. This proves that II( • ; U, h) is a contraction on B^, (1U°) uniformly for 
U G B^ (U°) and h g [0,/i*]. Here we used that B is at least C 1 on the highest 
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rung J>k of the scale since, in condition (Bl), we require N > K. This verifies 
condition (ii) of Theorem A. 9. 

Theorem A. 9 then applies and asserts the existence of a fixed point 

WE f) C^ e) (B^(U°)x(0,K);B y J-'(UH)). 

j+k<N 
£<k<K 

Assertion (3.21) for the W % then follows from Lemma A. 2. 

To prove the corresponding estimates for ^ h , note that by (3.9), condition (Bl), 
Lemma 3.11, and Lemma 3.13 we can adapt /i* > such that for j = 0, . . . , K, 

\\* h (U) U°\\ yj < \\S(hA)(U U°)\\ y . + \\S(hA)U° U°\\ y . + h\\b\\AM j 

<R/2+\\S(hA)U -U \\ y . +h\\b\\AAI 3 <R. 

Further, the first term of (3.9) is of class (3.21) by Lemma 3.13. For the second 
term of (3.9), we note that the map £ defined as 

E(W, U, h) = /i(id -haA^BiW) , 

satisfies 

SG f| ^'((^rxfexl;^). 

i+j+k<N 
e<k<K 

Lemma A. 6 then implies (3.21) for \I/. Assertion (3.21) for ^/ then follows from 
Lemma A. 2. 

Finally, differentiating (3.9) and taking the operator norm on y, we obtain 

\\™ h (U)\\ £(y)Ji <\\S(hA)\\ £{yhh 

+ ha Qs\\b\\ IKid-^aA)- 1 !!^, \\BB(W)\\ £(ys) ||D^^(Cr,/»)|| £(3?iy . ) 

< (1+ a h) + hs e s ||b|| AM/1 II Wll (la) v 

= 1 + a* h , 

where we use (3.2), (3.13) and (3.14a), and refer to (3.21) for the bound on W. 
This proves (3.22). □ 

While this theorem gives an existence and regularity result for the numerical 
time-h map ^> h , it does not yield control over the maximum step size /i* when we 
want to define ^ h on a general open bounded domain. We address this issue in the 
following theorem which is the discrete time analogue of Theorem 2.6. 

Theorem 3.15 (Existence and regularity of numerical method, uniform version). 
Let the semilinear evolution equation (2.4) satisfy conditions (A0) and (Bl) and 
apply a Runge-Kutta method subject to condition (RK2) to it. Moreover, assume 
(Al) or that the Runge-Kutta matrix a is invertible. Choose S > small enough 
such that T>x + i is nonempty. Then there exists h* > such that (3.20) and (3.21) 
and, under assumption (Al), (3.22) hold with bounds uniform for U° G T^k+i w *^ 
R = S. Moreover, the stage vector W{U, h) satisfies 

We f) C^ £ W +1 x[0,M;K-£) (3.25a) 

j + k<N 
£<k<K+l 
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and, if a is invertible, the numerical time-h map &(U,h) = & (U) satisfies 

*e f) ^'(C^iXlO,/!,];^), (3.25b) 

j+k<N 
£<k<K+l 

In particular, when N > K + 1 . 

W j G C^ +1 (V K S +1 x [0, ft.]; D), j = l,...,s, (3.26a) 

and, i/ addition a is invertible, 

* € Cf +1 (2?^ +1 X [0, ft,] ; T>) . (3.26b) 

The bounds on W , \P and ft* depend only on the bounds afforded by (Bl), (2.18), 
(3.14), (3.18), on ifte coefficients of the method, and on 5. 

Proof. Let R = S. We apply Theorem 3.14 for each U" E £>^+r Notc tnat for 
j = 0,...,K, 

Uid-iid-haAy^lU ^ < ft max ||aA(id -saA)- 2 lU°\\ V e 

y i se[0,h] y i 

<ftA 2 ||a||i? K+1 . 

Inserting this estimate into (3.23), we see that we can choose ft, > small enough 
such that II( • ; U, ft) maps Wj{U Q ) := B R (1U°) C y* into itself for j = 0, . . . , K, 
and, from (3.24), such that II is a contraction on Wj(U°) uniformly for U° G T>] <+1 , 
U e B^ +1 (U°), and ft e [0, ft*], where R* = R/(2 max{c s , A}). As in the proof of 
Theorem 3.14, we find that (3.20), (3.21) and (3.22) hold with uniform bounds in 
U° e V K S +1 , and that 

W\*e f) C$ i) (pjf +1 x[0,h m ]i'D k -t) (3.27) 

j+k<N 
£<k<K 

for i = 1, . . . , S. 

To prove that W actually maps into a space one step up the scale, we show that 
AW l & f| C ( h l ' e \v K 5 +1 x[0,K] ; y k . e ) (3.28) 

j+k<N 
l<k<K 

for i = 1, . . . , s. We apply A to (3.7), so that 

AW = A(id -fta^)- 1 !?/ + ftaA(id -haA)- 1 B{W) . (3.29) 

The first term of (3.29) is of class (3.28) by Lemma 3.11. For the second term, 
we note that, by (Bl) and (3.15b), 

E(W, U, ft) = ftaA(id -haA)- 1 B(W) 

is of class 

EG f) C^\{-D k ) s x y K+1 x X; y|_,) , 

i+j+k<N 
l<k<K 

so that (SoW)' is of class (3.28) for i = 1, . . . , s by Lemma A. 6. This proves (3.28). 
To prove that, for a invertible, A\I/ is also of class (3.28), we proceed analogously. 
Applying A to (3.9), we obtain 

AV h (U) = S{hA)AU + ftb T A(id -haA)~ 1 B{W{U, ft)) . (3.30) 
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The first term on the right of (3.30) is of class (3.28) by Lemma 3.11. We already 
proved above that V = E o W is of class (3.28). As a is invertible, b T a _1 y and, 
hence, (3.30) are of class (3.28). 

Next, we show improved regularity of W and \& with respect to the step-size in 
the same way as in the proof of Theorem 2.6. Namely, we prove that 

d h W\ d h V G f) cfVjf+i x [0, &*];?*_*) . 

j+k<N—l 

e<k<K 
Consider the (K + l)-scale of Banach spaces Zj = y? for j = 0, . . . , K and -Z^+i = 
y K with Wj = V] ioTJ = 0,...,K and Wjr+1 = V K . Set « = P A 5 +1 , X = y K+1 , 
and X = (0, /i*). Due to (3.15c) and (Bl), the map n from (3.7) satisfies the 
assumptions of Theorem A. 9 in this setting. This shows that dhW is of the above 
class, and proves, with (3.27), (3.28) and Lemma A. 3 claim (3.25a) for the stage 
vector W. Then (3.15c), Lemma 3.13, (3.25a), Lemma A. 6 and Lemma A. 7 applied 
to 

d h V h (U) = d h S(hA)U + b T (id -haA)- 2 B(W(U, h)) 

+ hb T (id -haA)- 1 DB(W(U, h)) d h W(U, h) , 

imply that dh^ is of the same class as dhW 1 . When a is invertible, then, using that 
A^ is of class (3.28) and using Lemma A. 3 as before, claim (3.25b) follows. 

Statements (3.26a) and (3.26b) are, as before, a consequence of Lemma A. 2. □ 

Remark 3.16. We actually showed in Theorem 3.15 that for a invertible 

j+fe<JV 
£<k<K 

i = 1, . . . , a, i.e. W l and VP have slightly higher regularity in U° than the semiflow 
$*. 

Remark 3.17 (Image of the numerical method). Analogous to the situation for the 
semiflow noted in Remark 2.8, the proof of Theorem 3.15 actually shows that, when 
a is invertible, 



Moreover, 



* e c<— ' V^+i x P. r.];Djc n y K+1 ) 



*£ f) C b W (V +1 x[0,T.];P l 

j + fc^AT 

£<fc<if+l 

(fc,f)#(if+l,0) 



Remark 3.18. The proof of Theorem 3.15 shows that, when (A0), (Al), (Bl) and 
(RK2) hold, but a is not assumed invertible, we still have 

*e p| C { h l - k+1 \v K s +1 x[0,TJ;V). 



j+k<N-l 
k<K 



Remark 3.19. If A generates a group rather than a semigroup, we may assume 
(Al) for h 6 [— /i*,/i*] for some h* > 0. Then Theorems 3.14 and 3.15 hold with 
h G [— h*,h*\ (for some, possibly, smaller choice of h* > 0). Moreover, we can 
then also weaken the requirement in (RK1) to \S(z)\ < 1 for z £ iR and still show 
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that (A2) implies (Al) for h G [—h*, h*]. In this setting, the proof of Lemma 3.10 
proceeds by recalling that, according to Remark 2.9, there exists ui > such that 
Rc(spccA) < uj. Hence, we can decompose QA into a skew-symmetric operator 
Ai = Im(A n ) and a bounded operator A-i = A^ + Re(A n ). 

3.3. Convergence analysis of yi-stable Runge Kutta methods. In this sec- 
tion we present a convergence analysis of A-stable Runge-Kutta methods applied 
to semilinear evolution equations (2.4). The main difficulty is to prove differentia- 
bility in the step size h of the implicitly defined Runge-Kutta methods as maps 
from a space of functions with higher regularity to a space with lower regularity. 

Theorem 3.20 (Convergence). Apply a Runge-Kutta method of classical order p 
subject to conditions (RK2) and (Al) to the semilinear evolution equation (2.4). 
Assume further that (Bl) holds with K > p. Pick 5 > such that T>~ +1 is non- 
empty and T > 0. Then there exist positive constants /i», Ci, and ci which only 
depend on the bounds afforded by (Bl) and (Al), (3.14), on the coefficients of the 
method, and on 5, such that for every U° with 

{&(U°):te[o,T\}cv£ 1 (3.31) 

and for every h G [0, h*], the numerical solution (^ h ) m (U°) lies in T> and satisfies 

\\($ h ) m (U°) - $ mh (U°)\\ y < c 2 e cimh h p 
so long as mh < T . 

Proof. We invoke Theorem 3.1 with Z = y p+ i, V z = £>~+i, X = y, 

V X = |J B y R (U)dV 

where R = S, and note that dist x ({U(t): t G [0,T]),dT> x ) > 5. To verify the 
assumptions of the theorem, we first note that local existence and regularity of a 
solution to the evolution equation (2.4) in the appropriate spaces is always guar- 
anteed by Theorem 2.6. In particular, for initial data U° such that (3.31) holds, 
we also have U G C([0,T];D^ 1 ) n C p+1 ([0,T}; V~ s ) with uniform bounds in the 
norms of both spaces. Conditions (CI) and (C2) follow from Theorem 3.15 and 
Remark 3.18. □ 

Remark 3.21. As explained in Sections 2.5 and 2.6, the semilinear wave equation 
and the nonlinear Schrodingcr equation satisfy the assumptions of Theorems 3.14— 
3.20 provided the nonlinearity is sufficiently smooth. 

In the following corollary we prove the convergence of the [/-derivatives of the 
numerical solution. 

Corollary 3.22 (Convergence of derivatives). In the setting of Theorem 3.20 there 
exist positive constants h*, Ci, and c-i which only depend on the bounds afforded by 
(Bl) and (Al), (3.14), on the coefficients of the method, and on 8, such that for 
every U satisfying (3.31) and for every h G [0, h*\ 

\\D^ h ) m (U°) - %$ mh (U°)\\ £Jiyp+i y) < c 2 ^ mh hP 
for j < N — p — 1 so long as mh < T . 
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Proof. We proceed by induction over j. The case j = is already asserted by The- 
orem 3.20. When j > 0, we note that U(t) = (U(t), W(t)) = ($*([/°), D$*([/°)W/°) 
satisfies 

-U(t) = AU + B(U) (3.32) 



where 



dt 
A= {o A) ' ^)= (dB(17)wJ ' 



and we take W° G Bk = int(Si (0)). Similarly, the Runge-Kutta method applied 
to (3.32) satisfies 

*^°) = ( D ** h ($)U) Wh6re ^ = (^ 

Equation (3.32) and the Runge-Kutta method applied to it again satisfy (Al), (Bl) 
with N replaced by N — 1 and 2?fc replaced by T>k X % for fc = 0, . . . , K and (RK2). 
We can therefore apply the induction hypothesis to the extended system so long as 
j+p<N-l. □ 

Appendix A. Contraction mappings on a scale of Banach spaces 

In the appendix we present a contraction mapping theorem on a scale of Banach 
spaces, our main technical tool. Our results are more general than precursor ver- 
sions in [31, 33]. The proofs are technically involved for two reasons. First, there 
is some combinatorial complexity in the estimates due to the implicitness of the 
fixed point of the contraction map. For this reason we decided to derive estimates 
in all required norms at once. Second, the maps we consider have derivatives with 
respect to the parameters that are only strongly continuous, but not continuous in 
the operator norm. This precludes a straightforward induction argument. What 
we find is that this weaker notion of continuity is entirely sufficient, but requires 
some extra care and notational effort. 

For K £ No, let Z = Z$ D Z\ D ... D Zk be a scale of Banach spaces, each 
continuously embedded in its predecessor, and let Vj,Wj C Zj be nested sequences 
of sets. Let X be a Banach space, and let U C X and Ictbe open. We note 
that all results in this section easily extend to the case where X is an open subset 
of W. Without loss of generality, we may assume that ||w||.z < ||to||.z. +1 for all 
w G Zj+i ■ (If this is not the case, we inductively equip Zj+i with the equivalent 
novm\\-\\ Zj+1 + \\-\\ Zj .) 

We use the following additional integer indices. The minimal regularity we guar- 
antee for the image space of the function considered is the regularity of the lowest 
scale index L of the image, the loss index S indicates how many rungs on the scale 
the range of a function is down relative to its domain, and N denotes the maximal 
regularity of the function. We assume < L < K — S < N — S. Taking the 
dependence on parameters into account, we work with the family of spaces 

Cjv.jf.i.sttV,-}, U,l; {Wj}) = f) cJf-'Ws xUxl; W fc _,) , 

i+j+k<N-S 



L+i<k<K-S 



endowed with norm 



HTTlll mnv ITV TV r)*TTll 

L+l<k<K-S 
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for < L < K - S < N - S, and abbreviate 

C N , K , L ({V 3 }M,I; {Wj}) = C NtK>Lt0 ({Vj},U,Z; {W,-}) , 
C JV ,jc({V ; -},Z/,2:;{W i })=C JViJ f i o,o({V i },W,I;{W i }) 

with corresponding norms 

|n|jv,if = lin||jv,/-c,o,o • 

Note that any function of class Cn,k,l,s has a maximal number of N — L — S 
derivatives in its first and second argument on the lowest admissible domain scale 

Zl+s- 

Furthermore, let 

C NjKjL {U,1; {Wj}) = f| C { ^\U x 1- W k - e ) , 



j+k<N 
L+i<k<K 



endowed with norm 



L+E<k<K 

for < L < K < N, where we abbreviate 

C NtK (U,l;{W j })=C N , Kt0 (U,l;{W j }) 

with corresponding norm 

IMIj\r,jr = \\ w \\n,k,o- 
For future reference, we note the following. 

Remark A.l. When a map II £ Cjv,a\l,s({Vj},W,X; {VV,}) does not depend on w, 
it can be interpreted as an element from Cn,k,l(M,I', {Wj}) where 

|n||| N,K,L,S = W"-\\n-S,K-S,L- 

Wc simply write Cm,k,l.s and Cm.k,l when the arguments arc unambiguous. We 
also write 

9^n(ty(«,/Lt);«,^) = d l Jl{w;u : ix)\ w=w{u ^ = (d,Jlow)(u,fi) 

to denote partial //-derivatives vs. D m (II(w(m, /x), u, /j,)) to denote full /i-derivatives. 
Wc begin with four short technical lemmas. The first specifies the relation be- 
tween the spaces Cn,k and C . 

Lemma A. 2. If N > K then, with W = W , 

C NtK (U,l; {Wj}) c Cg{U xX-W). 

Proof. Let w £ Cn,k(M,Z; {Wj}). Fixing £ = k in the definition of Cn,k and 
recalling (2.3b), i.e., strong and uniform continuity coincide if no derivative in u is 
taken, we find that 



G Q ' ) (W x I; W) n p) C;- J (W xl;W). 

j+£<A'+l 
£<A 

The claimed uniform continuity then holds because of (2.3a). □ 
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The following lemma captures captures the essence of the inductive step in N as 
needed in the main results which follow. 

Lemma A. 3. If w G Cjv jf i(W,I;{VVj}) and the map (u,u,ii) <-> T> u w(u, fj,)u is 
of class Cn,k,l{U x B*{0),I; {Zj}), then w G Cn+i.k.l{U,1\ {Wj}) and 

\\w\\n+1,K,L ^ SU P ll D « w «IU,if,L + ll w llAr,K,L- 

ll«IU<i 

Proof. The claim is a direct consequence of the partitioning of the index set in the 
definition of the (N + 1, K, L)-norm, see (A.l), into 

{0 < j + k < N + 1} = {0 < j + k < N} U {0 < j + k < N} 

where j = j ' — 1, using the definition of the operator norm, 

W T We(x,y) = SU P \\ Tx \\y> 
IMU=i 

and the definition of the (N, K, L)-norm (A.l). D 

The next lemma captures the essence of the inductive step in K. Namely, a scale 
of length K + 1 can be broken up into two scales which have only length K, plus a 
trivial remaining bit. 

Lemma A.4. When N > K, w G C NtK+ltL +i{U,l\ {Wj}) nC N , L , L (U,l; {Wj}), 

and d^w G Cn-i,k,l(U,Z\ {Zj}), then w G Cnj<+ix{U,I] {Wj}) and 



\ w \\n,k+i,l ^ W w \\n,k+i,l+i + \H\n,l,l + W d M 



N-1,K,L ■ 



Proof. Translating the scale, i.e., setting Zj — Zj + l, K = K — L, and N = N — L, 
we can reduce to the case L = 0. Since 

{0<£<k<K + l} = {0<£<k<K + l}U{l<e<k<K + l}U{k = £ = 0} 

and d^w G Cn~i,k if and only if 

f) C { h bi+1) (Uxl;W k -i)= f) C$ A (Uxl;Wk-l), 

j + k<N-l j+k<N 

Kk<K K£<k<K+l 



It' 



the claim follows directly from definition of Cn,k,l and its norm (A.l). □ 

Finally, we prove that the space Cn,k,o,S can be expressed in terms of Cjv^-type 
spaces with domains defined on a scale. 

Lemma A. 5. We have 

f) C n _ s ,k-s,l(Vk x U,X; {Wj}) = C N , K ,L,s{{V 3 },U,l- {Wj}) , 

S<k<K 

and 

l n lc N ,K,i„s({V J },W;I;{n' j }) ~ s^^W^Cn s, k -s.l(V k xU,X:,{Wj}) ' 

where ~ denotes that left hand and right hand sides provide equivalent norms on 
Cn,k,l,s- 
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Proof. Translating the scale, i.e., setting Zj = Zj+l, K = K — L, and N = N — L, 
we can reduce to the case L = 0. Next, we identify 

f| C N - S , K -s(V K x U;l, {Wj}) = f| C { h l ' i] ((V K x U) x Z;W fc _/) 

S<k<K S<k<K 

j+k<N-S 
£<k<K-S 

,(I,:M) 



f| C'fKxWxl^w) 



S<k<K 

i+j+k<N-S 
e<k<K-S 



,(i,J,Z) 



D C b \V- k+s xUxl;W k -e) 



0<k<K-S 

i+j+k<N-S 
Kk<k 



f| ^'(VnsxWxIiWw), 



i+j+k<N-S 
£<k<K-S 



which equals Cn,k,o,S- Noting that 



j+k<N-S 
£<k<K-S 

~ S<£<K H D «' D ™^A' n ll£oo(V K xWxI;£ i (2: K ;£J(Ar;2 fc _ f )) ' 
l<k<n-S 

the statement about the norms follows analogously. □ 

The next lemma will be our main tool for obtaining estimates on the scale of 
Banach spaces for compositions of maps of the form 

(n o w){u, /i) = n(w(u, p)] u, p) . 

The essence of the result is very natural: When the outer function II loses S rungs 
on the scale, the inner function w must have minimal regularity L = S and the 
composition maps at best into rung K — S. 

The main difficulty in the proof of this lemma and of the subsequent results 
is that the maps considered lose smoothness when derivatives in \x are taken. In 
particular, these derivatives are only strongly continuous with respect to the param- 
eters u and n and, in our infinite dimensional setting, are discontinuous with respect 
to u and /i in the operator norm. As a result, in the proofs below the induction 
hypothesis can not be applied to the derivatives in a straightforward way. 

Lemma A. 6 (Chain rule on a scale of Banach spaces). Let n = Ii(w]u,ii) and 
w = w(u, n) satisfy 

neC^K^siiWj}^,!;^}) and w<=Cn !K! s + l(U,I;{Wj})- 
Then How <E Cn-s.k-s,l{M,T; {Zj}) and ||IIo w\\n-S,k—S,l can be bounded by a 
polynomial with non-negative coefficients in \\H\\n.k.l,s an d H'HIat.-R'.s+l- 

Proof. Translating the scale, i.e., setting Zj = Zj + l, K = K — L, and N = N — L, 
we can reduce to the case L = as in the proof of Lemma A. 4. We proceed by 
induction in N and K as follows. 



A-STABLE RUNGE-KUTTA METHODS FOR SEMILINEAR EVOLUTION EQUATIONS 33 

For N = K = S, we have IT G C b (W s xUxl; Z ), w G C h (U x 1; Ws), hence 
Ilowe Cb(U,I; Zq) with bound ||n|s,s,o,S- 

Let us now increment N holding K and S fixed. Let B = 23* (0). We claim that 
the map 

(«,ii,/i)i-)D„(IIo«i)ii is of class Cn-s,k-s(U x B,X;{Zj}) (A. 2) 

with a bound which is a polynomial in |n|Ar +1 x,o .s an d |w||A r +i--R"-S- The induc- 
tive step is achieved by Lemma A. 3 which then asserts that II o w € Cn+i-s,k-s 
with its norm bounded as required. 
To prove this claim, let u G £>, write 

D„(n o w)(u, /u) u = (9 u II(i(j(u, /i); u, /x) u + 9 u ,n(u;('u, /i); u, /it) T> u w{u, n) u , (A. 3) 

and consider each term on the right of (A. 3) separately. For the first term on the 
right, set u = (u, u) G U x B = IA and define 

III (to; u, fi) — d u Tl(w] m, /i) u . (A. 4) 

By assumption, this map is of class Cjv,if,o,s({VVj},W x B,X; {Zj}) . The induction 
hypothesis, applied to the maps IT and w, then asserts that 

nio»e C N - S ,K-s{U x B,l;{Zj}) (A.5) 

and that its Cjv— s,.k"— s-norm is bounded by a polynomial with nonnegative coeffi- 
cients in inliv+i^cs > |||Hi |||jv,ic,o,S and ||u)||jv,if,s- 

For the second term on the right of (A. 3), we must proceed in stages. Fix 
r = \\w\\n+i,k,s and l et V K = B^ K (0). For k = S, . . . ,K and (u, w) G U x V Kl we 
set 

H2(w; (u, w), jj,) = D u ,n(-u;, u,/x)w. 

By assumption, this map is of class Cn,k,o,s({Wj},U x V k ,X; {Zj}). The induction 
hypothesis, applied to the maps II2 and w, then asserts that 

n 2 owe C N - S , K -s(U x V K ,1; {Z 3 }) (A.6) 

and that its Civ_s iK _s-norm is bounded by a polynomial in HwHat^s > ||«>||jv,k,s 
and 

l n IU+l,K,0,S SU P ¥"ttZ K - l n 2| Cjv „ s ({W,},«xV B ,l;{2,}) ■ 
w£V K 

We now consider the composition II2 o w as a map 

n(w); u, n) = d w Ii(w(u, (J,); u,fj,)w. 

Recalling that (A.6) applies for all k = S,...,K, we can apply Lemma A.5 to 
obtain that 

neC Ar ,i f ,o, 1 sr({V J -},W > I;{^}) 
and that its norm is bounded by a polynomial in r \P1\In+i,k,o,S and || tf || jv. j£T.s- 
(This is summarized in Lemma A. 7 for later use.) 

Now consider II as a function of w, u = (u, u) <EU x B, and fi. Since 

\\Vuw(u,n)u\\ Cb{ u x X: Zj ) ^ HliV+i,K,S \\u\\ x <r 
for j = S,...,K, the function w(u, /J,) = D u w(u, [i) u is of class Cn,k,s(J^ x 
B,I; {Vj}). Applying the induction hypothesis to II and w, we conclude that IToty 
or, written explicitly, the map 

((u, u), n) i-)- d w IL(w(u, fj,); u, //) D u w(u, fi) u 
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is of class Cn-S,k-s(M x S,Z; {.Zj-}), with norm bounded by an increasing poly- 
nomial in |n|jv+i,-K',o,S and ||w||jv+i,-K",S > ||tw||jv,jc,5. Due to (A. 3), (A. 4), and 
(A. 5), this also holds for the map ((u, u),n) *— > D u Ii(w(u, fi); u, fi) u, thus proves 
our claim (A. 2); the inductive step in N is complete. 

Next, we increment K — S keeping N fixed. Here the inductive step will be 
achieved by Lemma A. 4; we must hence verify its assumptions. First, applying the 
induction hypothesis on the scale Zj = Zj+\ with j = 0, . . . , K, we infer that 

n o w £ C N -i- S ,K-s{U,I\ {•%}) = Cn-s,k+i-s,i(U,Z', {Zj}) 

with the corresponding norm bounded by a polynomial with nonnegative coefficients 
in |n|jv',Ar+i,o,s > |||II||| Ar, J ftr+i,i,S'-i-i and \\w\\ N j <+1 , s +i- Second, by the induction 
hypothesis applied on the trivial scale, 

LTo w e Cjv-s,o(W,T;{^}), 

with the corresponding norm bounded by a polynomial with nonnegative coefficients 
in |II||jv',a: + i,o ) s > ||n|jv,o,o,s,o and ||to|[jv,if+i,s > ||^||iV,o,S- Third, we claim that 

D M (II o w ) € br-i-s.K-siU,!', {Zj}) , (A.7) 

with the corresponding norm bounded by a polynomial with nonnegative coefficients 
in |||n|||jv,_fs"+i,o.s and ||w||jv,x+i,S- Then Lemma A. 4 applied to LTo-u; where N and 
K there correspond to N — S and K — S here proves that now £ Cn-s.k+i-s 
with the required bound on its norm; this concludes the inductive step. 

It remains to prove claim (A.7). Following the steps in the estimate of D„(IIom;) 
above, we write 

D M (n o w)(u, fi) = <9 M n(w(w, /i); u, fj) + d w Il(w(u, /i); u, /i) D M w (A. 8) 

and consider each term on the right of (A. 8) separately. For the first term, note 
that the assumption on II implies, in particular, that c^II £ Cn.k+i.o,s+i and 
that, by assumption, w £ Ca^a'+i.s+i- Since K — S is not increased, the induction 
hypothesis applies to this pair of maps and yields 

9 M n o w £ C N -s-\,k-s(U, I; {Zj}) (A.9) 

with a polynomial bound in ||II||jv,if+i,o,s > l^nliv.iir+i.cs+i and ||to||jv,i<:+i,S > 
\\w\\n,k+i,s+i- 

For the second term on the right of (A. 8), fix r = ||w||jv,i<:+i,s and let Vj = 

B r 3 (0) for j — S, . . . , K + 1. We saw above that the map II from (A. 11) is of class 
Cn-i,k,o,s({Vj}M,Z; {Zj}) with norm bounded as specified in Lemma A.7. The 
assumption on w and the definition of r above imply, moreover, that 

D M tueC JV _ M r,s(W ) :r;{V i }). 

Thus, the induction hypothesis applied once more to this pair of maps yields 

ft o d^w = D w (n o w)D l ,w £ C N -is, K -s(U,I', {Zj}) (A.10) 

with a polynomial bound in |n|jv,K+i,o,s and ||w||jv,k:+i,s > \\dn w \\N,K,s- To- 
gether, (A.9) and (A. 10) imply (A.7) with the required bound. □ 

In the proof of Lemma A. 6, we implicitly proved the following result which we 
state here for later reference. 
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Lemma A. 7. Let IT and w satisfy the conditions of Lemma A. 6 with L = 0; let 
or 7=0,..., K . Then 

ll(w; u, (i) = T) w Il(w(u, /x); u, ff) w (A. 11) 



r > and Vj = Br 3 (0) for j = 0,...,K. Then 



satisfies 

neC^_i, Xl0 ,s({V ; -} > W,I;{^}) 

with a polynomial bound in ||w||jv-i, .K^s an d r ||n|jv,i<",o.S- 

Remark A. 8. The Faa di Bruno formula (see, e.g., [10]) can be used to compute 
the derivatives of compositions of functions explicitly. However, it does not remove 
the need to estimate complete Cn,k norms. Thus, an inductive argument seems to 
be the most manageable way of writing out a proof. 

We now proceed to the crucial contraction mapping theorem for maps II ( • ; u, /i) 
of class Cn,k- 

Theorem A. 9 (Contraction mappings on a scale of Banach spaces). ForN,K G No 
with N > K , let Z = Zq D Z\ D ... D Zk be a scale of Banach spaces, each 
continuously embedded in its predecessor, let Wj C Zj be a nested sequence of 
closures of open sets, let X be a Banach space, and let U C X and Iclie open. 
Let (w, U, fJ,) <-> n(ui; u, /i) be a nonlinear map such that 

(i) neC N ,K({Wj},U,l',{Wj}); 

(ii) w <— > H(w; u, (J,) is a contraction on VVj with contraction constant c' < 1 
uniformly for all u £ IA, /i£ I, and j = 0, . . . , K . 

Then the fixed point equation Tl(w; u, /x) = W has a unique solution 

u;eC w ,A-(W,I;{Wj}) 

and \\w\\n,k is bounded by a function which is a polynomial with non-negative 
coefficients in \\H\\n,k and (1 — c') — . 

Similar theorems were proved in [31] for the case K = 1, U = and in [33] 
for the case N = K 6 N, U = 0. Due to Lemma A. 2, the theorem as stated 
here implies, in particular, that w £ C^ilA x X;W). This simple statement on 
C K differentiability is reminiscent of the standard form of the contraction mapping 
theorem with parameters as, for example, stated in [19, p. 13]. 

Proof of Theorem A. 9. The argument is once more an induction in N and K, fol- 
lowing the combinatorial pattern of the proof of Lemma A. 6. For N = K = 0, the 
regular contraction mapping theorem with parameters asserts that w € C(lAxI; W). 
Moreover, II is a contraction uniformly for (u, fj.) S c\(U) x cl(I) so that n has a 
unique fixed point iy(u*,/x*) also for (it*,/x*) on the boundary of U x I. From 
this, a straightforward estimate yields continuity of w up to the boundary; thus, 

Assume now that the conclusion of the theorem holds for fixed K and N > 
K . We first employ Lemma A. 3 to show that the conclusion also holds when we 
increment N, holding K fixed. 

As in the proof of Lemma A. 6, we set B = B*(0) and let (u,u) £ U x B =U. 
Differentiating the fixed point equation w = LT o w with respect to u, we find that 
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T) u w it formally solves the fixed point equation w = Yl(w; (u, u), /i), where 
Tl(w; (u, u), fi) = d w Tl(w(u, jjl); u,fj,)w + d u TL(w(u, /i); u, //) u 
= II(u); u, n) + d u TL(w(u, /j,);u,/j,)u. 
Using the chain rule Lemma A. 6 and Lemma A. 7, we infer that 

ILeC N , K ({V j },hlxB,l;{Z j }) 

with Vj = B r J (0) for j = 0, . . . , K and arbitrary r > 0. Here, we must prove in 
addition that II maps each of the Vo, ■ ■ • , Vr- into itself. Indeed, a direct estimate 
shows that it suffices to take 

iiiTTii, 1 ^ W d ^ow\\ Cao{UxIS{xz)) 
T = ft mm is a max > max ; — . 

N + 1 > K > S j=Q K 1-c'j ~ j=0,...,K 1 - c] 

The induction hypothesis then applies tofle Cn,k{{Vj},U X B,X;{Vj}), yielding 
the existence of a fixed point w € Cn,k(U x B,X;{Vj}). 

It remains to be shown that the formal identity w = T) u w u holds true on each 
Zj for j = 0,...,K. This, however, follows by [33, Theorem 4.8] (see also the 
proof of [31, Theorem 3]) applied to the one-parameter family of maps (w\v) t-> 
H(w; u + vu, fi) for fixed /i £ I, h £ W, and i£6on the scale {Zq, Z\\ = {Zj, Zj} 
for each j = 0, . . . , K. 

Altogether, since w <G Cn,k{U,I]{Wj}), Lemma A. 3 applies and yields gives 
w <G Cjv+i,if(W,X; {VVj}); the inductive step in N is complete. 

Next, we increment K < N holding N fixed. For this, we use Lemma A. 4. First, 
we note that assumptions (i) and (ii) holds on the K-step scale Z\ D • • • D Zk+i 
so that the induction hypothesis applies; we find that 

w€C NiK+l!l (Uxl;{Wj}). 

Second, by the induction hypothesis applied on the trivial scale, w € Cjv.o- Third, 
differentiating the fixed point equation w = II ow with respect to [i, we obtain that 
d^w formally solves the fixed point equation w = Yl(w; u, /i), where 

Tl(w; u, fi) — d w TI(w(u, /«); u, /j,) w + d f Jl(w(u, /«); u, /i) 
= tl(w; u, fi) + d^U(w(u, fi);u, fj) ■ 
Since, by assumption, w G Cpjj<, we infer from Lemma A. 6 and Lemma A. 7 that 

nec JV _i,*({v ; -} > WxB ) :r;{z,-}). 

Here, we need in addition that II maps each Vo, • • • , Vk into itself. This is satisfied 
whenever 

IIITTIII 1 ^ ll^n^ll^^xl;^) 

r = n In r-+i <? max '-, t ^ max i ; ■ 

N ' K + l ' b j=0,...,K 1 - dj j=0,...,K 1 - c' 

The induction hypothesis then applies to II e Cjv-i,jf({V,-}, UxB,X; {Vj}), yielding 
the existence of a fixed point w £ Cn-i.k{U x B,X\ {Vj}). By [33, Theorem 4.8] 
(see also the proof of [31, Theorem 3]), applied to (w; fi) *-> H(w;u,fi) for each 
fixed u G U on the two-step scale {Z , Z{} = {Zj,Zj + i} for each j = 0, . . . ,K, we 
ensure that the formal identity w = d^w holds true across the scale Z , . . . Zk- We 
conclude that d^w £ Cn~i,k- 
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Altogether, Lemma A. 4 applies and yields w £ Cn,k+i', the inductive step in 
K is now complete. We note that the required polynomial bounds are obtained, 
as before, by carefully tracking all the bounds in the respective norms through the 
argument. We omit all detail. □ 
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